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P. gingivalis antigens 



FIELD OF THE INVENTION 

The present invention relates to P. gingivalis nucleotide sequences, 
P. gingivalis polypeptides and probes for detection ofP. gingivalis. 

BACKGROUND OF THE INVENTION 

Periodontal diseases are bacterial-associated inflammatory diseases of 
the supporting tissues of the teeth and range from the relatively mild form of 
gingivitis, the non-specific, reversible inflammation of gingival tissue to the 
more aggressive forms of periodontitis which are characterised by the 
destruction of the tooth's supporting structures. Periodontitis is associated 
with a subgingival infection of a consortium of specific Gram-negative 
bacteria that leads to the destruction of the periodontium and is a major 
public health problem. One bacterium that has attracted considerable 
interest is P. gingivalis as the recovery of this microorganism from adult 
periodontitis lesions can be up to 50% of the subgingival anaerobically 
cultivable flora, whereas P. gingivalis is rarely recovered, and then in low 
numbers, from healthy sites. A proportional increase in the level of 
P. gingivalis in subgingival plaque has been associated with an increased 
severity of periodontitis and eradication of the microorganism from the 
cultivable subgingival microbial population is accompanied by resolution of 
the disease. The progression of periodontitis lesions in non-human primates 
has been demonstrated with the subgingival implantation of P. gingivalis. 
These findings in both animals and humans suggest a major role for 
P. gingivalis in the development of adult periodontitis. 

P. gingivalis is a black-pigmented, anaerobic, asaccharolytic, 
proteolytic Gram-negative rod that obtains energy from the metabolism of 
specific amino acids. The microorganism has an absolute growth 
requirement for iron, preferentially in the form of haeme or its Fe(III) 
oxidation product haemin and when grown under conditions of excess 
haemin is highly virulent in experimental animals. A number of virulence 
factors have been implicated in the pathogenicity of P. gingivalis including 
the capsule, adhesins, cytotoxins and extracellular hydrolytic enzymes. 



In order to develop an efficacious and safe vaccine to prevent 
P. gingivalis colonisation it is necessary to identify and produce antigens that 
are involved in virulence that have utility as immunogens to generate 
neutralising antibodies. Whilst it is possible to attempt to isolate antigens 
directly from cultures of P. gingivalis this is often difficult. For example as 
mentioned above, P. gingivalis is a strict anaerobe and can be difficult to 
isolate and grow. It is also known that, for a number of organisms, when 
cultured in vitro that many virulence genes are down regulated and the 
encoded proteins are no longer expressed. If conventional chemistry 
techniques were applied to purify vaccine candidates potentially important 
(protective) molecules may not be identified. With DNA sequencing, as the 
gene is present (but not transcribed) even when the organism is grown in 
vitro it can be identified, cloned and produced as a recombinant DNA 
protein. Similarly, a protective antigen or therapeutic target may be 
transiently expressed by the organism in vitro or produced in low levels 
making the identification of these molecules extremely difficult by 
conventional methods. 

With serological identification of therapeutic targets one is limited to 
those responses which are detectable using standard methods such as 
Western Blotting or ELISA. The limitation here is the both the level of 
response that is generated by the animal or human and determining whether 
this response is protective, damaging or irrelevant. No such limitation is 
present with a sequencing approach to the identification of potential 
therapeutic or prophylactic targets. 

It is also well known that P. gingivalis produces a range of broadly 
active proteases (University of Melbourne International Patent Application 
No PCT/AU 96/00673, US Patent Nos 5,475,097 and 5,523,390), which make 
the identification of intact proteins difficult because of their degradation by 
these proteases. 

SUMMARY OF THE INVENTION 

The present inventors have attempted to isolate P. gingivalis 
nucleotide sequences which can be used for recombinant production of 
P. gingivalis polypeptides and to develop nucleotide probes specific for 
P. gingivalis. The DNA sequences listed below have been selected from a 



large number of P. gingivalis sequences according to their indicative 
potential as vaccine candidates. This intuitive step involved comparison of 
the deduced protein sequence from the P. gingivalis DNA sequences to the 
known protein sequence databases. Some of the characteristics used to 
select useful vaccine candidates include; the expected cellular location, such 
as outer membrane proteins or secreted proteins, particular functional 
activities of similar proteins such as those with an enzymatic or proteolytic 
activity, proteins involved in essential metabolic pathways that when 
inactivated or blocked may be deleterious or lethal to the organism, proteins 
that might be expected to play a role in the pathogenesis of the organism 
eg. red cell lysis, cell agglutination or cell receptors and proteins which are 
paralogues to proteins with proven vaccine efficacy. DNA sequences that 
were considered to be poor vaccine candidates and not selected include 
those that code for proteins involved in replication, non-essential proteins 
involved in cellular processes and those proteins present at sites that would 
be unlikely to be affected by immune mediators such as those found in the 
bacterial cytoplasm or inner membranes. 

In a first aspect the present invention consists in an isolated 
P. gingivalis nucleotide sequence, the nucleotide sequence consisting of or 
including a sequence selected from the group consisting of SEQ ID NO: 1 
to 44, fragments thereof and sequences complementary thereto. 

In a second aspect the present invention consists in an isolated 
P. gingivalis polypeptide, the polypeptide being at least partially encoded by 
a nucleotide consisting of or including a sequence selected from the group 
consisting of SEQ ID NO: 1 to 44, fragments thereof and sequences 
complementary thereto. 

In a third aspect the present invention consists in an isolated 
P. gingivalis polypeptide consisting of or including an amino acid sequence 
selected from the group consisting of SEQ ID NO: 45 to 137 or fragments 
thereof. 

In a fourth aspect the present invention consists in an isolated 
polynucleotide, the polynucleotide encoding a polypeptide consisting of or 
including an amino acid sequence selected from the group consisting of SEQ 
ID NO: 45 to 137 or fragments thereof. 

In a fourth aspect the present invention consists in a nucleotide probe 
specific for P. gingivalis, the probe including a detectable label and a 



nucleotide sequence of at least 15 nucleotides, the nucleotide sequence 
being derived from a sequence selected from the group consisting SEQ ID 
NO: 1 to 44 and sequences complementary thereto. 

In a fifth aspect the present invention consists in a composition for 
use in raising an immune response in an animal directed against P. gingivalis 
the composition including an acceptable carrier and/or adjuvant and at least- 
one polypeptide having a sequence selected from the group consisting of 
SEQ ID NO: 45 to 137 or fragments thereof. 

As will be understood by those skilled in the art the nucleotides of the 
present invention may be useful in DNA vaccination to reduce the incidence 
and/or severity of P. gingivalis infection. 

Accordingly in a sixth aspect the present invention consists in a 
composition for use in inducing an immune response, the composition 
including at least one DNA molecule, the at least one DNA molecule having 
or including a sequence selected from the group consisting of SEQ ID NO: 1 
to 44, fragments thereof and sequences complementary thereto. 

Further information regarding DNA vaccination may be found in 
Donnelly et al, Journal of Immunological Methods 176(1994) 145-152, the 
disclosure of which is incorporated herein by reference. 

Throughout this specification, unless the context requires otherwise, 
the word "comprise", or variations such as "comprises" or "comprising", will 
be understood to imply the inclusion of a stated element or integer or group 
of elements or integers but not the exclusion of any other element or integer 
or group of elements or integers. 

DETAILED DESCRIPTION 

Preparation of the P. gingivalis library for sequencing. 

To determine the DNA sequence of P. gingivalis genomic DNA was 
isolated from P. gingivalis strain W50 (ATCC 53978) essentially by the 
method described by Mamur J. (1961)(1). Cloning of DNA fragments was 
performed essentially as described by Fleischmann et al., (1995)(2). Briefly, 
purified genomic DNA from P. gingivalis was nebulized to fragment the DNA 
and was treated with Bal31 nuclease to create blunt ends then run twice on 
preparative 1% agarose gels. DNA fragments of 1.6-2.0 kb were excised from 
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the gel and the DNA recovered. This DNA was then ligated to the vector 
pUCl8 {Smal digested and dephosphorylated; Pharmacia) and 
electrophoresed on a 1% agarose preparative gel. The fragment comprising 
linear vector plus one insert was excised, purified and this process repeated 
5 to reduce any vector without insert contamination. The recovered vector 
plus insert DNA was blunt-ended with T4 DNA polymerase, then a final 
ligation to produce circular DNA was performed. Aliquots of Epicurian Coli 
Electroporation-Competent Cells (Stratagene) were transformed with the 
library DNA and plated out on SOB agar antibiotic diffusion plates 
10 containing X-gal and incubated at 37°C overnight Colonies with inserts 
appeared white and those without inserts (vector alone) appeared blue. 
Plates were stored at 4°C until the white clones were picked and expanded 
for the extraction of plasmid DNA for sequencing. 

15 DNA sequencing 

Plasmid DNA was prepared by picking bacterial colonies into 1.5ml of 
LB, TB or SOB broth supplemented with 50-100ug/ml Ampicillin in 96 deep 
well plates. Plasmid DNA was isolated using the QIAprep Spin or QIAprep 
96 Turbo miniprep kits (QIAGEN GmbH, Germany). DNA was eluted into a 

20 96 well gridded array and stored at -20C. 

Sequencing reactions were performed using ABI PRISM Dye 
Terminator and ABI PRISM BIGDye Terminator Cycle Sequencing Ready 
Reaction kits with AmpliTaq DNA polymerase FS (PE Applied Biosystems, 
Foster City, CA) using the M13 Universal forward and reverse sequencing 

25 primers. Sequence reactions were conducted on either a Perkin-Elmer 

GeneAmp 9700 (PE Applied Biosystems) or Hybaid PCR Express (Hybaid, 
UK) thermal cyclers. Sequencing reactions were analysed on ABI PRISM 
377 DNA sequencers (PE Appled Biosystems). Raw trace data files from the 
ABI 377 sequencer were manually trimmed using Staden Pregap (Laboratory 

30 of Molecular Biology, Medical Research Council, UK) running on a Sun 
Microsystem computer. Trimmed files were assembled into contigs using 
Staden Gap v4.1 and exported as FastA consensus files. 

DNA sequence data was supplemented with sequence downloaded 
from the Internet site of the Institute for Genome Research 

3 5 ( http ://www, tigr . org) . 



DNA sequence analysis 



DNA files in FastA format were converted to GCG format files before 
translation into amino acid files using the program Flip obtained from 
ANGIS (Australian Genomic Information Service, University of Sydney, 
Australia). FastA homology searching and/or PSORT analysis(4,6) were 
used as data mining tool to select potential vaccine candidates. All proteins 
with a PSORT probability of outer membrane >0.8 were selected. The FastA 
results for these proteins were then examined for any alignment with a 
protein suggesting surface location of vaccine efficacy. 

All proteins were searched for homology against a non-redundant 
Bacterial protein database compiled byANGIS using the FastA algorithm. 
The settings used for the FastA searches were Ktup = 2, gap creation penalty 
= -12, gap extension penalty = -2, width for deriving alignment in opt = 16 
and the Blosum 50 scoring matrix. Individual FastA search results were 
examined for significant homology by statistical probability and amino acid 
alignments. 

The results are set out in Table 1. 

Protein files were then trimmed to the first, second and third 
methionine residues using a protein trimming program (ANGIS). The 
trimmed proteins were then subjected to PSORT analysis (4, 6) for the 
detection of signal sequences and the prediction of protein cell location. 

The results are set out in Table 2. 

The sequences obtained are set out below. The relationship between 
these sequences is set out in Table 3. The initiation codon was calculated 
using a combination of sequence homology alignment (FastA), signal 
sequence prediction (PSORT) or ORF prediction (GeneMark) (3). 

As will be understood by those skilled in the art open reading frames 
(ORFs) may be readily identified. ORFs may be determined using three 
methods, for example, alignments from FastA search results may be used to 
define the start and end positions of coding regions if sufficient protein 
homology is present. Alternatively, protein coding regions may be identified 
using the ORF identification program GeneMark (3) using a matrix trained 
on published P. gingivalis sequences. This matrix may be further refined by 
adding ORFs identified from the results of homology searching and ORFs 
identified by GeneMark. Otherwise, the program PSORT (4,6) may be used 
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for the detection of signal sequences at the start of a protein and the 
prediction of cell localisation. A UNIX version of TopPred (5) may also be 
used to identify potential membrane spanning domains. 

It will be .appreciated by persons skilled in the art that numerous 
5 variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 

Dated this fourth day of August 1998 

CSL LIMITED 

Patent Attorneys for the Applicant: 
F B RICE & CO 
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Table 2 Results of protein analysis using Psort. 

The type of signal sequence and the amino acid number of the most C-terminal 
residue of the signal sequence is listed. The abbreviations for cell location used in 
the table are: OM = outer membrane, IM = inner membrane, PS = periplasmic 
space. 

The terminal amino acid may indicate surface location if it is an aromatic amino 
acid (F or Y). 

* indicates an uncleavable signal sequence is present. 
** indicates a lipoprotein signal sequence is present 



Seq ID# Signal present Cleavage 
(Y/N) site 



89 


Y 


20 


90, 91 


Y 


20/24 


92 


Y 


41 


93, 94 


Y 


21/20 


95 


Y 


28 


96 


Y 


21 


97 


Y 


32 


98 


Y 


19 


99 


Y 


27 


100 


Y 


20 


101 


Y 


21 


102 


Y 


20 


103 


Y* * 


19 


104 


Y 


24 


60 


Y 


20 


105 


Y 


28 


106 


Y 


26 


107 


Y 


20 



Cell location and probability Terminal 

amino 
acid 



OM 


IM 


PS 




0.94 


0 


0.31 


N 


0.82 


0 


0.19 


F 


0.93 


0 


0.25 


F 


0.85 


0 


0.44 


K 


0.93 


0 


0.25 


F 


0.94 


0 


0.29 


F 


0.93 


0 


0.26 


G 


0.93 


0 


0.22 


N 


0.94 


0 


0.32 


H 


0.94 


0 


0.3 


L 


0.93 


0 


0.24 


F 


0.93 


0 


0.24 


F 


0.79 


0.87 


0 


V 


0.94 


0 


0.32 


L 


0.93 


0 


0.21 


Q 


0.8 


0 


0.25 


Q 


0.94 


0 


0.3 


H 


0.93 


0 


0.19 


F 
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Table 2 (cont) 

Seq ID# Signal present Cleavage 
(Y/N) site 



108 


Y 


21* 


109 


Y 


24 


110 


Y 


57 


111 


Y 


47 


112 


Y 


22 


113 


Y 


28 


114 


Y 


18 


115 


Y 


23 


116 


Y 


25 


117, 118 


Y 


19/18 


119, 120 


Y 


25/23 


121 


Y 


21 


122 


Y 


19 


123 


Y 


36 


124 


Y 


20 


125 


Y 


20 


126 


Y 


21* 


127, 128, 129 


Y 


24/20/21 


130, 131 


Y 


25/24 


132 


Y 


29 


133 


Y 


46 


134 


Y 


24 


135 


Y 


23 


136 


Y 


25 


137 


Y 


17 



Cell location and probability Terminal 

amino 
acid 



OM 


IM 


PS 




0 


0.18 


0 


F 


0.93 


0 


0.2 


N 


0.93 


0 


0.21 


S 


0.89 


0 


0.21 


N 


0.94 


0 


0.32 


V 


0.94 


0 


0.38 


K 


0.93 


0 


0.25 


F 


0.85 


0 


0.30 


E 


0.88 


0 


0.35 


N 


0.89 


0 


0.25 


P 


0.93 


0 


0.25 


N 


0.94 


0 


0.3 


F 


0.85 


0 


0.3 


R 


0.93 


0 


0.27 


Y 


0.93 


o 


0.46 


L 


0.86 


0 


0.25 


Q 


0 


0.04 


0 


K 


0.53 


0 


0.2 


P 


0.91 


0 


0.31 


Y 


0.93 


0 


0.21 


F 


0.91 


0 


0.03 


H 


0.93 


0 


0.24 


F 


0.94 


0 


0.29 


D 


0.85 


0 


0.37 


R 


0.36 


0 


0.22 


D 
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Table 3: Relationship between DNA and protein SeqID numbers 
DNA sequence ORF ORF trimmed ORF trimmed ORF trimmed 
un trimmed to 1st Met to 2nd Met to 3rd Met 



1 


45 






2 


46 




89 


3 


47 


90 


91 


4 


48 






5 


49 


93 


94 


6 


50 


95 




7 


51 




96 


8 


52 


97 




9 


53 


98 




10 


54 


99 




11 


55 




100 


12 


56 


101 




13 


57 


102 




14 


58 


103 




15 


59 


104 




16 


60 






17 


61 


105 




18 


62 


106 




19 


63 


107 




20 


64 


108 




21 


65 


109 




22 


66 


110 




23 


67 


111 




24 


68 


112 




25 


69 




113 


26 


70 


114 




27 


71 


115 




28 


72 




116 


29 


73 




117 


30 


74 


119 


120 


31 


75 


121 




32 


76 




122 


33 


77 


123 




34 


78 


124 




35 


79 


125 




36 


80 


126 




37 


81 


127 


128 


38 


82 


130 


131 


39 


83 


132 




40 


84 


133 
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Table 3 (cont) 

DNA sequence ORF ORF trimmed ORF trimmed ORF trimmed 
untrimmed to 1st Met to 2nd Met to 3rd Met 

41 85 134 

42 86 135 

43 87 136 

44 88 137 
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(2) INFORMATION FOR SEQ ID NO : 1 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

10 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas gingivalis 

20 (ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1. . . 1768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



30 



35 



agactatcaa 


atccatgaag 


gggcctatat 


cattccggtc 


aaagagcaaa 


tgcgaccact 


60 


attcaatggc 


aacaacaact 


tcagcatcta 


tgtagtggcc 


aatctcgatt 


tcaatgctcc 


120 


ggccacagaa 


gctgcgcttt 


ctcaatttgt 


ggtagagaaa 


tctattgaag 


tctcttctac 


180 


gacagcccct 


gccgatttcg 


taatgcttgc 


tcatggcaat 


aagcagatca 


atatggctac 


240 


gacagaaggg 


aaactgttgg 


gggattataa 


actcaaacga 


gtggcagcaa 


agattcgcat 


300 


gataaaaccc 


accatcaatg 


tgcaaggata 


tgaagtggtc 


ggaaatatac 


aggcaaagtt 


360 


tcgcaattcg 


gtaacgaagg 


ggttccttac 


cacagaagct 


caagagatcc 


cagctgctgc 


420 


atcctataag 


acatcggaat 


atcttgatat 


tgcagagtcg 


gcacctgcca 


attctatcca 


480 


tttctattct 


tactataaca 


aatggacact 


ctccacaccg 


gagaagcgac 


cggaattctt 


540 


catcatggtc 


aaattcaaaa 


agacaggaca 


gccggacaac 


acagccaaac 


cgtactacta 


600 


cagagtgccc 


ctcgaatctc 


aggacaatca 


ggtcaagagc 


aatgtcctct 


ataatctgaa 


660 



tgtgaaaatc gaaatcttgg gttctttaca agagccggaa gctgtttctg taaacggcac 
actcgcaata gaagaatgga ttctccatca ggatgcattc aatctgcctg ccaccaatta 
cttgatagtg gaacagcacg aaatcttcat gaataacgtg aacacatact cggtgaaata 
tcaaacttcg cagaaaccaa tcagcattag catacagtca gttaccttta gctacgtctc 
ttctgatggc actcagcaca atgatcttgt agcaagtagt agcgaccagt atcctacgat 
tacaagcgat aatacaagca tcataatcac ttccaagata ccggttaata acgtaccaaa 
gaagatcgtt tttgaggtaa ctaatggggt agccggtttg aaagagactg tcacagtact 
ccaatatcct gcacaattta ttgtcaatac acttggcaca gcatcggcat ggagaccaga 
cggatctttg gctccggggc ttaacaataa agcgatttac catgtcgtag tactggttcc 
acccgagaat ttatttgaag atgggacaca gacaatcatc ggttatcccc ccactgaaac 
aatttctttt cataagaaag agaacaatac ctatccgata gtatggtctg acacaaatac 
gacaaaacag gaccttgaga catcaagaat gatttcacct tcctttgagt tagcctccca 
acttggggct actctcccga tgccctatct cgagtattgg ccagggacat catatctcct 
tgactattcg ggaaactata ataataagag atacgccttg tttaattgcg ctttttactg 
ggagaaaaga aaagttaata acgaagaaat taaattcgat gactggcgtt tgccgacaga 
agctgagatc aaattgatag ataagctgca acataatgag cagagtgctg tccaagctat 
catgacaggg aattattatt gggatagtta ctctgcaaat gggtcttata aaatgcaagg 
aggagggggc caaggaaatt cctccaaagc ctatgttcgt tgcgtgcggg atgtgaaaaa 
gccgattcgt gacaagaagt caggtaag 

(2) INFORMATION FOR SEQ ID NO : 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Porphyromonas gingivals 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1...2721 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



ACTATATGCG TTGCCTTCTT ATCCGCTCCC GTTGCGGCAT TGCGCGAGTC GCCGCCAATG 
GGGGCGGAGC GGAAGACTCC CTCTCTCCTA CCCCTTCTTT TCGAGCATTC TGATAAAGGA 
AAGGGGTTCG ACTACAGACT ATTCACAAGC AATAAACTTA AAGTGTTTGC CACAGGTAAC 
AGCCGATACA TACACAATAA ACCAACAATC AT C CAAG CAA TGAAACGAAT CGTTTTATCA 
TCTTTCCTGT TCGTTCTGTC CATACTTTCT TTGATGGCAC AGAACAATAC CCTCGATGTA 
CACATATCCG GTACGATCAA GGATGCCTCC TCCGGCGAAC CAGTGCCCTA TGCCACTGTA 
AGCATCCGGC TGACAGGAGC AGATACCACA CAGGTGTTCC GACAAGT GAC TGACGGCAAC 
GGCTACTTCG TCATAGGCCT GCCGGCAGCT CCCTCCTATC ACCT GACAGC TTCGTTCGTA 
GGTATGAAAA CCCATACCAT GCAGATTAGT CGGGGAAATG GACAGCACGA CATCAAATCC 
ATCGACATTT CTCTCGAATC CGAGGACAAA CAACTCTCCA CCGTCACCGT ATCGGCAGCA 
CGACCACTGG T GAAGATGGA GATAGACCGC CTGTCCTATA ATAT GAAAGA TGACCCCGCA 
GCCAAGACGA ACAACCTGCT CGAAATGCTG CGCAACGTTC CTTTGGTAAC GGTGGATGGT 
CAGGGCAATA TCCAGGTGAA AGGATCTTCC AACTTCAAAA TCCACCTCAA TGGCAGGCCC 780 
TCGACCATGG T GAGC AGCAA CCCGAAGGAG GTCTTTCGCT CCATTCCTGC CCATACGATC 84 0 

AAACGGGTGG AGGTCATCAC CGATCCGGGT GTAAAGTACG ATGCGGAAGG CACAAGTGCC 900 
ATCCTGGACA TCGTCACGGA AGAAGGTAAG AAGCTGGAAG GATATTCAGG- TTCCATCACG 960 
GCCAGTGTCA GCAACAATCC CACAGCCAAC GGTAGTAT CT TTCTGACGGC AAAGTCCGGC 1020 
AAAGTCGGGC TGACTACCAA CTATAAC TAC TACGGTGGCA AAAACAAGGG CTCTCGCTAC 1080 
TTTACCGAAC GTACT ACAT C CATGCTCCAA ACGATAGAAG AAGGCAAAGG GCAAGAAACC 1140 
TTTGGCGGAC ACTTCGGCAA TGCCCTCCTC TCATTCGAGA TAGATTCGCT CAATCTCTTT 1200 
ACGGTGGGCG GCAATGTACG CCTTTGGGAG ATGACCACCG ACCGGAACAG CGTAGAAAAA 1260 
AGCTTTGCCG GCAGCAACCT CATGTCCTAC ATAGACAGAA AACTCAAAAC ACAGAT GGAT 1320 
GCCGGATCAT AC GAGC T CAA TGCCGACTAT CAGCACAGCA CTCGCCTGCC GGGCGAATTG 1380 
CTCACCGTTT CCTACCGCTT CACTCACAAT C CTAATAAT A GCGAGACCTT CATTGACCAA 1440 
TGGAAGCGCG ATCCGCTCAA CACAGCTAAT ACGATCCAGT ACGCCGGCCA GCACTCCAAA 1500 
TCCGATGCGG GCATGGACGA ACATACGGCA CAAGTGGACT ATACACGTCC CTTAGGACAA 1560 
GCACATTCTT TGGAAGCAGG GCT GAAGTAC ATCTATCGTC ATGCCACGAG CGATCCTCTC 1620 
TATGAGATAC GACCATCCGA AGATGCTCCG TGGCAGCCCG GCTCTCTATA TGCACAGAAT 



60 
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720 



1680 
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CCGTCGAACG GAAAGTTCCG C C AC GAT C AA T AC AT C G GAG CAGCCTATGC CGGCTACAAC 174 0 

TATCGTAAGG AT C AGTATT C TTTGCAAACC GGCCTCCGAG TGGAAAGCAG CAGGCTGAAA 18 00 

GCACTCTTTC CCGAAAACGC AG C AG C AGAT TTCTCCCACA ACTCGTTCGA CTGGGTGCCA 18 60 

CAGCTCACGC TCGGCTATAC CCCCTCGCCC ATGAAGCAGC TTAAGCTGGC CTATAACTTC 
5 CGAATCCAAC GTCCTGCAAT CGGCCAACTG AATCCCTACC GGCTACAGAC CAACGATNAT 

CAAGTACAGT ATGGTAATCC CGACCTAAAG TCGGAGAAGC GTCACCACGT CGGTCTCTCC 2 04 0 

TATAATCAAT ACGGAGCCAA GGTCATGCTT ACAGCATCGC TCGACTACGA CTTCTGCAAC 2100 
AACGCCATCC AGAATTACAC CTTCTCCGAC CCGGCCAATC CCAATCTGTT CCACCAGACC 
TATGGCAATA TCGGACGAGA GCATTCTTTC AGCTTGAATA C CTAT G C CAT GTACACGCCG 
10 GCCGTATGGG TCAGGATTAT GCTCAACGGA AATATCGATC GCACATTCCA AAAGAGCGAA 
GCACTCGGCA TT GAT GT CAA TTCATGGTCC GGCATGGTAT ACTCAGGCCT GATGTTCACC 
CTGCCGAAGG ATTGGACTGT GAATCTCTTC GGAGGTTATT ATCATGGGGG AAGAAGCTAC 

CAGAC GAAGT AT GAT GG CAA TGTATTCAAC AATATCGGTA TAGCCAAACA GCTTTTCGAC 2 4 60 

AAAAAATTGA GAGTCTCGCT GAGCGCAAAC AACATT CAT G CGAAGTATTC GACAT GGAAG 2 520 

15 AGCCGGACCA TCGGCAATGG ATTTACTATT TATTCGGAAA ATGCCGGTAT ACAACGGAGT 258 0 

GTTTCCCTCA GCCTCACCTA CAGCTTCGGT AAGAT GAAT A CACAAGT GC G CAAGGTAGAG 2 64 0 

C GT AC GAT CG TCAACGACGA CCTCAAGCAA ACCTCATCCC AAGGACAGCA GGGTGGCGGA 2 7 00 
CAAGGAAATC CTACCGGCAA T 
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(2) INFORMATION FOR SEQ ID NO : 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1. . . 1818 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



ATAATCTTCT 


GT AC GAT T CA 


. TCACTCTGAG 


TTGGAAATAA 


TGAACAGCAT 


CATGAAATAT 


60 


CAATTATATA 


, CGGCCGTCAT 


AATGGCTCTC 


TCTGTATCAT 


CCGTTTGCGG 


TCAAACCCCA 


120 


C GAAAT AC AG 


AAAC CAAAC G 


CCCCGACACG 


CTGCGCAGGG 


AGCTTACTAT 


CGTTAATGAC 


180 


CAGACTGTGG 


AGATGGAGCA 


TGCGGATCCG 


CTTCCGGCTG 


CAT AC AAGG C 


CAT C GAAC CT 


240 


C GATT AAAAC 


CTTTCCGTCC 


GGAATATAAC 


AAGCGTACAT 


TCGGATTTGT 


CCCTGAAGTT 


300 


TCCTCTTCAG 


GCAG GAACAA 


TCTTCCGAAT 


ATCCTGCCGA 


CGGAAGGTCA 


TAT GAAGC AC 


360 


CGGGGGTACC 


TGAATATCGG 


TAT C GGC CAT 


ACGCTAAACC 


AGCGAATGGA 


TGCCGGCTAT 


420 


C GT C T GAT AG 


ATGCAGAGCA 


GGAGAGACTG 


AATCTTTTCC 


TCTCCTATCG 


TGGGATGAAA 


480 


TCGGCTTTCA 


AT AC C G GT GA 


CTTCGACGGC 


GACAGAAAGG 


AT AGAC GAAT 


GAT G G C AG GA 


540 


GTGGACTACG 


AGCAGCGCAG 


GCCTTCCTTT 


GTGCTTGCTA 


CCGGCTTGTA 


TTATT C GAAC 


600 


CAT T ATTT C A 


ATAACTACGG 


ACGGGGAGCT 


ACCACCAATG 


TGGGCAGCAT 


CCCTCAGCTA 


660 


TCGACACCTG 


TTACTCCTCA 


GAT GGACAAC 


G G G AC C C AC A 


ACGTCCGTGT 


ATACTTGGGT 


720 


GCAAAAAATG 


AT GT GAT C GA 


TGCCAGGATC 


GACTATCGTT 


TCTTCCGTTC 


TATTCCCTAT 


780 


CTGGGTACCG 


AT C C GAT G AA 


GGCTCTCACA 


G AAC AT AC G C 


CTGAACTGAA 


CGTGACGATG 


840 


AGTAAT GAGT 


TGTCCGATGA 


TATTAAGCTC 


GGTGTCGAAG 


TTCGTACGGG 


AGGATTGTTT 


900 


TTTGCCAAAA 


AC AG C GAAAT 


GATT CAAAC G 


GGCGTTCTGT 


CCGAAACCGA 


CCGCAACCTG 


960 


TATTATGTGG 


AGGGCGCGCC 


CACAATCGGA 


TTTGTCGGAG 


ACTCGGACAA- 


TATGCAATGG 


1020 


AACATACAGG 


CCGGAGTAGG 


GATTTCTTCC 


CATTTCGGAG 


C CAAAG GGAG 


GTTGTTTTTC 


1080 


TGGCCTAAAC 


TGGATGCTTC 


GCTTAGTATC 


TTCCCTTCAT 


GGCGTGTGTA 


TGCGAAAGCC 


1140 


TTCGGCGGTG 


TGATTCGAAA 


TGGTCTCGCC 


GATGTTATGC 


AAGAGGAGAT 


GCCCTACCTG 


1200 


ATGCCCAATA 


C GATT GTACT 


CCCTTCGCGC 


AATGCTTTGA 


CCGCCCAATT 


AGGGGTGAAG 


1260 


GGGAATATAG 


CCGATGTGGT 


ACGTAT GGAG 


GTTTATGGCG 


ACTTCTCCAA 


GCTGACAGGT 


1320 


GTGCCTTTCT 


ATACTCCGAC 


TCTACCCTTA 


TATAATCCAT 


CCGACTTGTA 


TCAGTATAAT 


1380 


GTGAGTTTCT 


TGCCGATATA 


TGCCGACGGC 


AGCCGCTGGC 


GCGCAGGTGG 


TAAGCTGGAA 


1440 


TACTCTTATC 


GCGATATGCT 


CCGCTTTCTG 


GT AGAC G CAT 


CCTATGGCAA 


GTGGAATTTG 


1500 


GATGGAGGAC 


TTGTCGCCTC 


CATGCAGCCC 


GATCTTATAT 


TGAAGGCAGA 


AGTAGGTGTT 


1560 


CATCCCATTG 


CCCCATTGGA 


TGTCAGACTC 


C GGTAT AC AC 


AGCTGAACGG 


ACGGTATCGG 


1620 


TATTCTTTCG 


GCTCGGCTGG 


CTCGGAAGCC 


TTGGGTATCG 


GTAATGTACA 


TCTTCTTAGT 


1680 


GCGGATGTTT 


CATACAAGCT 


GAAAAAGAAC 


TTGAGCCTTT 


ATCTCAAAAT 


CGATAATATG 


1740 



4 



21 



CTGGCGGAAA CGACAGAGCT TATCGGTTAT TATCCTATGC AGCCGTTCCA TTGTTTCGCC 18 00 



GGTTTTAGCT GGACTTTC 

5 (2) INFORMATION FOR SEQ ID NO : 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1...1071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



1818 



AACTCCCATG CAACAGTCTC CGTTATATGT TCAATGATGG AAAAAT G TAT CTTTGCTCAC 60 

TATCCACATA ACCTTGTGTT CAT GATT C G C AAGCATTTCG GTATCATTTT GGGATTTCTT 12 0 

TCTCTTGTGT TTTCGGCAGG TGCTCAACAA GAGAAGCAGG TGTTTCATTT TCTGAACCTT 18 0 

CCGGCTACTG CACAGGCTTT GGCTGCCGGA GGCAAAGCTA TCACCATCGT AGACGACAAT 24 0 

CCCGGACTGG CTTTTGAGAA TCCGGCTCTG CTCGGATATG AATCCGGTGG CCGCGCCTTT 300 

CTTTCCTATT TATATTATAT GAGTGGTTCG CATATGGGCA ATGCCTGTTA TGCCTCGTCC 360 

GTCGGAGAGC GTGGCATGTG GGGTGTTGGC ATGCGTTTCC TGAACTACGG GTCTATGCAA 42 0 
35 GGAT AC GAT C AGAATGCGAT TGCCACCGGC TCTTTTAGTG CTTCGGATAT AGCTGTACAA 



480 



GGATTTTACA GC CAT GAACT GAGCAACCAC TTCCGCGGTG GAGTCAGCCT AAAAGCATTG 54 0 



22 



TATTCTTCTA TCGAGACGTA TAGTTCCTTT GGCCTTGGTG TGGATGTCGG TAT CAGTTAT 
TACGACGATG AC AAAG GAT A TTCCGCTTCC GCTCTGTTCA AGAAC GTAGG GGCGCAACTG 
AAAGGCTATA AT GAAGAAC G GGAACCGCTC GATTGGGATT TCCAGCTCGG CTTTTCCCGC 
AGTTTTATCA ATGCTCCGTT TCGCTTGCAC ATCACGTTGT TCAATCTGAA TCCGCACTAT 
TTCAAGCGTC T T GT AC C AC G CGATCTGTCC AAGAT G C AAA AGTTCCTCCG ACACTTCTCG 
ATAG GAG C AG AATTTACTCC TTCCGAGAGG TTTTGGGTCG GGCTGGGATA TACGCCACAG 
AT T GCACAG G ATTTCGAGGT GGAAGGCGGC AACAAATGGG GAGGTCTTTC GGCCGGCGTC 
GGTTTCACTT CAGGTGTAGT ACGTGTAGGC GTATCTGCTG CCACCTATCA TCCTGCAGCT 
CTTTCGTTCA TGTGTTCGGT AGGTATCCGT TTGGACGATA AGAGCATCTT C 

(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 693 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 

AAGTTTATGA TGAAAAAAGC ATTTGTTTTC GTACTACTGG TTTGCCTATT CTCCTCGTTC 
AGCAGTTCCG CCCAAACAAC GACGAACAGT AGCCGGAGTT ATTTTACAGG ACGAATCGAG 



23 



AAGGTGAGTT TGAACTTAGG GGTCCCCCCC GTAAGCACAG AGGTTTGGGG AAT G AC C CAT 18 0 

GATGCGAACG GTCTCCCTTT CGAAATACCT ATCTCTTTCA GTCGTTTCAA CAGCCAGGGA 240 

GATATAGCTA CCACTTATTA CATAGCGAAT AGCGAGGCAA CTTTGAATGA ATGGTGCGAC 300 

TAT G C AC AC C CGGGCGGCAT CGTGAGGGTA GAAGGTCGTT TTTGGAAAAT GACTTACAAC 360 

5 AT AC CAAC CT ACAAT GCAGT CTGCACCCGG ATTACATTCG AAAATCAAGA AATAGAAGGA 42 0 

ACGATCGTCT T GAT AC C CAA GCCCAAAGTC TCGCTGCCTC ATGTGTCGGA ATCGGTGCCT 480 

TGCATCCGAA CCGAAGCCGG GAGGGAATTT ATCCTTTGCG AAGAAG AC GA CACCTTTGTG 54 0 

TCTCACGATG GTAAC GAAGT AAC GAT AG GC GGTAAACCTT TCTTGCTCAA T AC CAAC GT A 600 

AAGATTGTGG GGGACGTATC TCAAAAGTAT GCCGTGGGGG TAGGAGAAAT TCGATTCCTG 660 

10 CAGATTT GT G CCCAAACAGT AT C AC AAC AA AAA 693 



(2) INFORMATION FOR SEQ ID NO : 6 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

20 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



30 (ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...1698 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6 

35 

TACGACGGAG CGAGACTGGT CTATACCCTA TTTCGGAATA GAAACGACAT TCACCCTATG 



60 



24 



10 



25 



AAAACCATTG TAAGATACAG CCGCCTTCCG GTCGCTCTCT TCTTTTGCCT TTTGGGAGCT 
GTGCACTTGT CTGTCGAGGC GCAGAT GCT C AATACTCCTT TCGAGCTGTC GGATCAGATC 
GTCCTGTCTC C C AC C GAAAG GCAATACAGG GAGATTTGTG TGCAAACGAA AGAAAAAAGG 
GGGGCCGATC TTTTCCCGTT GAGC GATAAG CTGCGCGATT CGGCCTATGT TCGTTTCGGC 
TCGGCCTATG GCGATATTGC GGGCGACTAT CTTCCGTACA ACGGCAATAA CTACTCCTCG 
CTCTCGCTCG AATCGGGTGG TCGCATCAGT GTCCGTAACT ATGGCACATT GCAGGGCAGT 
GCTTCCTACT CACGTGGCAT GCACAAACGC ATCGGCTGGA ATGCTCTGCG CAACGCCGAA 
GCCTACTATC CCTATTTGGT GTCCGATTCG ACCGGCGGAG ACT AT CAT TT C GAAGACT AT 
CGGCTTGCCG GCTACTATTC TTTTCGCGCC GGCCGCTTGC CCCTCGGTAT AGGCTTCTCA 
TACAGGGGCG AAGTTGCTTA TCGGCTGACC GATCCGCGTA C GAC CAATAC GACCGGTGCA 
TTGGAGCTTT CTTGTGCTAC CTCTTTGACG CTGCCTCGAG AGAACAG GC T ATCGCTTTCG 
GCTGCGTATC TC TAT CAT AG ACAACACCTC ACACAGTACA ACTGGCGTCC CGGGCAGCAG 
GACAAATTCT TCGTCAGCTA CGGTTTCGGT CAGGTGGATG TCAGCAACAG CCCTATCTGG 
TTCGGTATCT CCAGAATGAA CTACGTCAAC G GAT GGAAG C TTAGCTCCCG TCTGGATACC 
15 CGTAGGGGCG ATGCCATCGG TCTCGACTAC AGCGGCTACT TCCTCGATAC CGAaGAGAGG 
TCGTCCATCA ATCTCTTTGC TTTGCTTTAC AATCGCCTGC GACTCTATGG TAGCTGGCAT 
CTGTCGGACT TCGATTTTTC ATTTTCAGCC GACTATGCTC TGCGCCAAGG GATAGAGCGG 
ATATACGAAG ACTACAAGCC GGAT GATAAT TATCATATCT ACGACCTCCG TATCTTGGCC 
ATTCGCCGCT GGTATATGCT CAATGAGTTT TCTGCCCAAG CCCAAGCCTC CTACCGTATT 
CGCACGGATA GAGGTTGTGC CCTGAGAGTG AGTGCCGGTA GTGATTTCTA CGGCTATGAT 
GAGAC GT AT C GCAAGCATGG AC AT CAT AC C AT GAGC GGAA TGCTACGTCC TTTTGCCGGT 
ATAGCCTATG ACCATGCCGG AT C CAAATT G GATTTTGGAC TTTCGCTTTC GGCTGCTTAT 
CGAATGGTGC TGACGCATTC GTATAAGATT CGTACCATCC AGAAAGAGCA GCTCGACTAT 1440 
CAGCTGGCCT ATTTGCCCTA TGCCTATCGT AATAGAGAAG GCGTGGAGGT- GCGTTCCTCT 1500 
CTGTACGTCT CGATTCCGAT GCAGAATACC CACCGCCTGA T GACAGAGCT GCGGTTGTAT 1560 
GGCGACCTGA TGAAAAGAAA GGACGGTATA GCCTATGGCA AAACGCCCGG TGTCATCTCA 162 0 

CATATCCTGT CCGATCCGCA AGCCGAACGA ACGTCCGGCC AT AC CAT C GG GGCTATCTGC 1680 
AATATCTCCT ACCTCTTC 
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(2) INFORMATION FOR SEQ ID NO : 7 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2457 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .2457 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 

AGGACGAACG TTTTCTTATC TTTGTCCCAT AAAATTGGGA GAAGGGGTGC TTCCTGTAGC 
AATCGGAATG CGTGGATGGC T GAGAACAAA CCCTCATCAC CTGAACCGGA TAATACCGGC 
GTAGGAAACT CTCCGTCTGA CTATCTTCAC GGCGAAGCAA TCATTCCCCC TCTCTCTTCT 
TTGTCCAACT TCAATGATAA GAGATTTATG AAAAAACTTC ACATGATTGC CGCCTTAGCC 
GTCCTGCCTT TCTGCCTGAC GGCACAAGCA CCCGTCTCCA AC AGC GAGAT AGATAGT CTT 
AGCAATGTGC AGCTCCAGAC CGTACAGGTC GTAGCTACTC GCGCCACGGC GAAAACCCCT 
GTCGCTTACA CCAACGTTCG CAAGGCCGAA CTTTCCAAGT CCAATTATGG- TCGTGACATC 
CCCTATCTGC TGATGCTGAC TCCCTCCGTG GTAGCCACCA GCGATGCCGG TACGGGTATC 
GGATATTCCG GCTTTCGCGT GCGTGGCACC GATGCCAATC GCATCAACAT AACT AC CAAT 
GGAGTACCCC TCAACGACTC CGAATCTCAG TCCGTCTTTT GGGT GAATAT GCCCGACTTC 
GCCTCTTCCA TCGAAGACCT TCAGGTGCAG CGAGGTGTGG GTACTTCCAC CAATGGTGCC 
GGAGCTTTTG GGGCAAGTGT CAAT AT GC GT AC GGATAAT T TGGGACTGGC TCCTTATGGC 
CGTGTCGATT TGAGCGGAGG TTCGTTCGGC ACATTCCGCC GATCGGTCAA ACTCGGTAGC 
GGACGCATCG GTCGCCATTG GGCAGTGGAT GCCCGCCTGT CCAAAATCGG TTCGGACGGC 
T AC GT G GAT A GAGGAAGCGT GGATCTGAAA TCCTATTTCG CACAGGTGGG CTATTTCGGT 
AGCAACACGG CTCTCAGGTT CATCACTTTC GGAG GAAAAG AAGTTACGGG TATCGCATGG 
AACGGTCTTT CCAAGGAGGA TGAAGCCAAA TATGGCCGCC GATACAACAG TGCCGGTCTT 
ATGTACGTGG ACGCGCAAGG AGTAC C GCAC TACTACCACA AT AC C G AC AA TTACGAGCAG 
CGTCACTACC AT GC CAT CAT GAC GCACAG C TTCTCTCCTT CCGTTATCCT CAACCTCACG 
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GCACACTACA 


CGGCCGGATA 


TGGCTATACG 


GACGAATATC 


GTACCGGACG 


TAAACTAAAG 


1200 


GAATATGCAC 


TGCAGCCCTA 


TGTGGAAAAC 


AGTGTGACCG 


TGAAGAAAAC 


GGATCTCATC 


1260 


CGTCAGAAGT 


ATCTGGACAA 


TGACTTCGGA 


GGACTCATCG 


GTTCGCTTAA 


CTGGCACACC 


1320 


GGTGCATGGG 


ATTTGCAGTT 


CGGGGCCTCG 


GGCAATATCT 


ATAAAGGAGA 


CCACTTCGGC 


1380 


CGTATCACTT 


AC AT CAAAAA 


GTACAATCAG 


CCCTTAGCTC 


CCGACTTCGA 


AT ATT AT C GG 


1440 


AACAGGGCAG 


ACAAAAGAGA 


AGGTGCAGCC 


TTTGCCAAAG 


CCAACTGGCA 


GATCACTCCG 


1500 


GAACTGAACA 


TGTATGCCGA 


CCTCCAGTAT 


C GT AC CAT C G 


G CTACAC GAT 


AAACGGCATC 


1560 


ACGGACGAAT 


AT GAT GAG GT 


ACAGGGAAGT 


AT GCAG CACA 


TCGATTTGGA 


CAAGACCTTC 


1620 


CGCTTCCTCA 


ATCCGAAGGC 


CGGTCTTACC 


TATAGTTTCG 


AC GAT G CT C A 


T AC T G C CT AT 


1680 


GCTTCTGTTG 


CGGTAGCACA 


CCGCGAGCCT 


AACAGAACCA 


ATTACACCGA 


AG C C GGAATA 


1740 


GGACAGTATC 


CTACGCCTGA 


GCGACTGATC 


GACTATGAGC 


TGGGCTACCG 


CTATGCTTCG 


1800 


CCCCTCTTGT 


CGGCCGGAGT 


AGGTCTCTAT 


TATATGCAAT 


ACAAGGACCA 


ACTCGTGCTG 


1860 


GATGGCCGTT 


TGAGCGATGT 


GGGACAGATG 


CT CACAAGCA 


ACGTCCCCGA 


C AG C T AC C GT 


1920 


ATGGGACTGG 


AGCTGACTCT 


CGGTTGGCAG 


ATCCTTCCTC 


GTTTGCTGCG 


TTGGGATGCT 


1980 


TCTTTCACTA 


TGAGTCGCAA 


CAAAATCGAC 


CGCTACGTAC 


AAT AT AC AT C 


CGTATATGAT 


2 04 0 


GCGGACTACA 


ACTGGCTCGA 


ACTCAAGGAG 


GAGACCCTCG 


AAAGCACGGA 


TATAGCCTAC 


2100 


TCGCCCAATG 


TCATTGCCGG 


CAGCATGCTT 


ACCCTCTCTC 


ATGCCGGTTT 


CGAAATGGCT 


2160 


TGGACGAGCC 


GCTTCGTCAG 


CAAGCAATAT 


CTGGACAATA 


CACAGCGCAG 


CGATCGCATG 


2220 


CTTTCCTCCT 


ATTGGGTGAA 


CGACCTCCGC 


CTCGGCTATG 


TGCTGCCGGT 


TCACTTCGTT 


2280 


AAGAGAGTGG 


CACTGGGCGT 


ACAGCTCAAT 


AATCTCTTCA 


ACCTCATGTA 


TGCGTCCAAT 


2340 


GCCTACATCT 


ACGATGCCGG 


TTACGTACAG 


GCATCCGGAG 


AACTAAGTGC 


AT AT GC C GAT 


2400 


CTGCGTTATT 


ATCCTCAGGC 


CGGATTTAAT 


GCACTGGGTA 


NT CT GACAAT 


CGATTTC 


2457 



25 (2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (iii) HYPOTHETICAL: NO 



27 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

5 

(ix) FEATURE : 

(A) NAME /KEY: misc_f eature 
{ B) LOCATION 1. . . 957 

0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 



15 



20 



25 



TTTACGTCCG 


G TAC GAT ATT 


CGTCCGTATA 


GCCATATCCG 


GCCGTGTAGT 


GTGCCGTGAG 


60 


GTTGAGGATA 


AC GGAAGGAG 


AGAAGCTGTG 


C GT CAT GAT G 


GCATGGTAGT 


GACGCTGCTC 


120 


GTAATTGTCG 


GTATTGTGGT 


AGTAGTGCGG 


TACTCCTTGC 


GCGTCCACGT 


AC AT AAGAC C 


180 


GGCACTGTTG 


TATCGGCGGC 


CAT ATT T G GC 


TTCATCCTCC 


TTGGAAAGAC 


CGTTCCATGC 


240 


GATACCCGTA 


ACTTCTTTTC 


CTCCGAAAGT 


GATGAACCTG 


AGAGCCGTGT 


TGCTACCGAA 


300 


ATAGCCCACC 


TGTGCGAAAT 


AGGATT T CAG 


ATCCACGCTT 


CCTCTATCCA 


CGTAGCCGTC 


360 


CGAACCGATT 


TTGGACAGGC 


GGGCATCCAC 


TGCCCAATGG 


CGACCGATGC 


GTCCGCTACC 


420 


GAGTTTGACC 


GATCGGCGGA 


ATGTGCCGAA 


CGAACCTCCG 


CTCAAATCGA 


CACGGCCATA 


480 


AGGAGCCAGT 


C C CAAATT AT 


CCGTACGCAT 


ATT GACACT T 


GCCCCAAAAG 


CTCCGGCACC 


540 


ATTGGTGGAA 


GTACCCACAC 


CTCGCTGCAC 


CTGAAGGTCT 


TCGATGGAAG 


AGGCGAAGTC 


600 


GGGCATATTC 


ACCCAAAAGA 


CGGACTGAGA 


TTCGGAGTCG 


TTGAGGGGTA 


CTCCATTGGT 


660 


AGTTATGTTG 


ATGCGATTGG 


CATCGGTGCC 


ACGCACGCGA 


AAGCCGGAAT 


AT C C GAT AC C 


720 


CGTACCGGCA 


TCGCTGGTGG 


CTACCACGGA 


GGGAGTCAGC 


ATCAGCAGAT. AGGGGATGTC 


780 


AC G AC C AT AA 


TTGGACTTGG 


AAAGTTCGGC 


CTTGCGAACG 


TTGGTGTAAG 


CGACAGGGGT 


840 


TTTCGCCGTG 


GCGCGAGTAG 


CTACGACCTG 


TACGGTCTGG 


AGCT GCACAT 


TGCTAAGACT 


900 


ATCTATCTCG 


CTGTTGGAGA 


CGGGTGCTTG 


TGCCGTCAGG 


CAGAAAGGCA 


GGACGGC 


957 



(2) INFORMATION FOR SEQ ID NO : 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



28 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

5 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

10 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . . 729 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 



20 



25 



TCGACAATAA 


TGAAAAAAGC 


TATTCTTTCC 


GGAGCGGCCT 


TGCTCCTCGG 


CCTATGTGCC 


60 


AAC GCACAAA 


ACGTGCAGTT 


G C AC T AC GAT 


TTCGGTCATT 


CCAT CTACGA 


CGAACTAGAT 


120 


GGACGTCCCA 


AACTGACTAC 


CACAGTGGAA 


AACTTCACAC 


CCGACAAATG 


GGGAAGCACC 


180 


TTCTTCTTCA 


TCGACATGGA 


TTACACGGGC 


AAGGGTATCC 


AGTCGGCCTA 


TTGGGAGATT 


240 


TCGCGCGAAC 


TGAAGTTTTG 


GCAAGCTCCC 


GTTTCCATTC 


ATTT GGAGTA 


CAACGGAGGC 


300 


CTCTCCACAA 


GCTTTACTTT 


C GGACAC GAT 


GCTCTAATCG 


GTGCCACCTA 


CACCTACAAC 


360 


AACCCCTCCT 


TTACACGTGG 


AT T T AC GAT C 


ACGCCCATGT 


ACAAGCATCT 


GGGTGCGCAC 


420 


GACTTCCACA 


CCTATCAGAT 


CACCGGCACT 


TGGTACATGC 


ACTTTCTGGA- CGGTCTGCTT 


480 


ACCTTCAACG 


GCTTCCTCGA 


TCTTTGGGGT 


TTCCCCCAAG 


AGAACCCAAT 


CGGGGGCCCT 


540 


GTGCTCAAAG 


AAGGGGATAA 


GTTCGTATTC 


CTGTCCGAAC 


CGCAGTTCTG 


GATCAACCTC 


600 


AATCGCATCA 


AAGGCATCGA 


CAAGGATTTC 


AATCTCAGCA 


TAGGGACAGA 


GATGGAAATC 


660 


AGCAGGAACT 


TCGCTCGCAT 


GGACAAATTC 


TCCTGCATCC 


CTACTCTTGC 


GGTCAAATGG 


720 


ACTTTCAAC 












729 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 



29 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

3 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME /KEY : mis cofeature 
> (B) LOCATION 1. . . 1350 



20 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

TGGTGCCAAT CCGACCCAAT ACCTCATTGC CAT G CAGTAT ATCGAAACGC T CAAGGACAT 
CAACAAGGGC GATCAGACCA AGACCGTTTA CCTCCCGTTC GAGGCTACCG GTATGCTCGG 
TGCCTTGGGC GGTATGAAGG ATTTGGTGAA AGGAT GAGAC TCTCTGCCAT TCTTATCGCT 
TTGATTGTGA TGCTGCCTGC TGTGCTTAGC GGGCAGCATT ATTATTCCAT GGCGGGAGAG 
CGACTGGAGA C GGACAG CAT TCGTCCGAAC GAACTCTCGG CATCGATCCG AAGTGCGCTT 
TTCTTTCGGA ACAATGAATA CAATGCACGT TCGGTCAAAG GTTATACGTT. GCCGGGTGCA 
CGGGTTTCCG CTTTTGCCTC TTACTCGCTG CCGGCAGCAC ATGGTGTGAA GCTTTCGCTC 
GGAGTATCTA CCCTGAACTA CTGGGGGGCA AGTCGCTATC CGGCCGGTAT CGCTTATTCC 
GATTTACCTT ATTGGACGGA CTATAACGAC TATGTACGCT TGCGTATCCT GCCTTATGTA 
CAGGCCATGC TGAAGCCGAC GGCCACGACT GCTCTCATGC TGGGCAATAT AGCCGGTGGT 
ACGGCTCACG GACTGATCGA AC C GAT C T AC AATCCTGAGT TGGATTTGAC GGCTGATCCT 
GAAGCCGGTG TGCAATTTCG GGGTGATTGG ACACGTTTCC GAATGGATGT TTGGGTCAAT 
T G GAT GAG C A TGATTTTCAA AAATGACAAT CATCAGGAGT CGTTTGTCTT TGGCTTGTCC 
ACTACTTCGA AATTGTTATC GGGTGAAGGC AAATGGCGAC TCGAACTGCC CTTGCAGGCT 
ATTGCCACGC ATCGCGGCGG G GAATACAAC TGGGCGCAGC AGGAT AC C GT G CAT ACAT GG 
GTCAATGGAG CTGTCGGACT TAAGCTTTCG TATCGCCCTC GTACCGACAA AC C CAT G CAG 
ATTTGGGGAT CTGCTTATGG TGTGGCAGCC TTGTCAAGCG GAG GAT ACT T CCCTTACGAA 
AGAGGGTGGG GCGGTTATCT TTCTCTCGGA ATGGACTTGG AGCACTTCGC TTTTCGTACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



30 



GACTATTGGT ACGGCAGGCA TTACGTTTCT CCCTTTGCTG CACCTTTCGC CAATTCCCTG 
AC GT AT GAC A AACAGCCTCT TACGAACGGT TGGGGCGATT ATATTCGTCT CTATGCCGAC 
TATTCGTGGC GGATGGCACG AAGTGTTTCG TTGGCGGCTG TTGCTCGGGT ATGGTTCCAG 
CCTTCGGATC GTTTTGCGAT GAGCCACGCC TTGGAACTGA CGATGCGTAT CGATCCCAAA 
T T C C C AAT AG CTTTTCTGAA AGGCAATCAT 

(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . . 1341 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

CCGTTTTCAC CGCCATGGGT ATGTGCTGGA GCGGCTTCCT CAGTAC GC AC ACGGCCATGC 
TCGACAGTCT CAAGTACCGC CGCCTCATCA C ACAGGC GAT C GGAGC AC AT GCGATCGGCG 
GACTCGTCGC CGGCATATTC GCCCATTGGC TCTTTGTCCT CATCTCACTC ATTTGATTCG 
ATACCCGACC AACGTATGAA CAAATCGCTA TTATCATTGG CATGCCTCAT CCTGTGCGGT 
ATGCCGGCCA TCGCCCAACA GACAGGACCG GCCGAACGCA GCGGCGAGCC TTCTCTGGCC 
GAACGTGTAT TCGGTCTGGA GCAGAAGCAG AAAAAGCTGA AGGTGTACTT AGGCATACAG 



31 



10 



TCGTTCTACG ACCAGCCGCT TGTCGATGAC G AAT C C CAT A TCGGACACTT CAAGGTACAG 
GAGCTGCGGA TGTCTGCTCA TGGCGAACTG AACCGCCACC TCAGCTTCGA CTGGCGACAA 
CGTCTCAACC GTGCCGCCGA CGGCACTTCG TTTGCCGACA ATCTCTCCAA TGCCATCGAC 
ATCGCAGGTG TGGACTGGCA CCCGAACGAC AAGGTGTCTT TCTTCTTCGG ACGTCAGTAC 
GCGCGTTTCG GAGGGATAGA ATAC GACAT G AACCCCGTAG AGATCTACCA GTACAGCGAC 
CTTGTGGATT ACATGACCTG CTATACTTCG GGCGTGAACT TCGCATGGAA CTTCCACCCC 
GAACAGCAGC TGCAGCTACA GGTACT CAAT GCTTACAACA ACCGCTTCGC CGACCGCTAC 
CACGTGACAC CCGATGTCGC TACCGCCACG AGCTACCCGC TCCTCTACTC GGCACAGTGG 
AACGGTACCC TCCTCGGAGG AGCACTGCAT ATGCGTTACG CCGTGTCGAT GGCTCATCAG 
GCCCAAGAGC GTAAT AT GT G GTACTTCACT GCGGGCAACC TGTTCAATCC GGGCAAACGG 
ATCAACGGAT ACCTCGACCT CACCTACTCG ATCGAGGGAT TGGACGACAA AGGCATTATG 
ACTGCTCGCT ACGGCAAGGG CAAGACCCTC ACGGACGTCA AGTAC TAT GC TCTGGTATCG 
AAGTGGAACT TCCGCATTTT CGATCAGGTC AATCTCTTCC T CAAAGG CAT GTAC GAGAAC 
GGCTATGCGC CTGCCCAATA CGGCGAGAGC AGCCACACGC GCCACTCCTA C GG C TAT AT G 
15 GGAGGGGTGG AATATTACCC T AC G GAG AC C AACTTCCGTC TGTTCGTCAC CTACATAGGA 
CGGCATTACC GGTACAGTGC GACCGAGACG GAAAGCACCA ATGCTCTTCG CGCCGGTCTG 
AT C TAT C AGA TACCTTTCTT A 



20 (2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1341 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



30 



32 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 1596 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



180 
240 
300 
360 
420 



TACAGGCGAT CGGGAAGGGT TTGTCCTCGT ATCCTCCGGA ACAAACGTTC CTACGCAGAT 60 

ACTCGGTTAC AG C C GAGAAG AGCGGTTCGA CTACGAGCCG GCACCAGAAC AAAGAT GAAA 12 0 
AGAAGGTTTC TATCGCTGTT ACTGCTGTAC ATACTCTCTT CCATCAGCCT TTCTGCTCAG 
10 CGGTTTCCGA TGGTGCAGGG AATCGAGTTG GATACCGATT CGCTTTTCTC TCTGCCCAAG 
CGTCCTTGGC GCGCCATCGG TAAAAC GAT A GGCGTCAATC TGGCCGTATG GGGCTTCGAT 
CAT T T CAT C A TGAACGAGGA CTTTGCAGAC ATCAGTTGGC AGACTAT CAA GAG CAATT T C 
CAAACAGGCT TTGGCTGGGA CAATGACAAG TTTGTCACCA ACCTCTTCGC ACATCCTTAT 

CACGGATCGC TCTATTTCAA TGCAGCGAGG TCGAACGGTT TGAGCTTCAG GCACTCTGCT 480 

15 CCGTTTGCCT TCTTTGGCAG TCTCATGTGG GAGCTGCTTA TGGAAAACGA GCCACCGAGT 54 0 

ATCAACGACC TCTGTGCCAC CACCATAGGC GGTATAGCTT TGGGGGAGAT GGGGCACAGG 600 

CTGTCGGACC TGCTCATCGA CAATCGTACC ACAGGGTGGG AACGTATGGG GCGCGAGGTG 660 

GCTATCGCTC T GAT CAATCC GATGCGCTTT CTCAACCGTC TGACAGCAGG AGAGGT GACT 72 0 

TCTGTCGGGA GTCGCAGCGG ACAGATATTT CAGTCTGTCC C C AT AAAC AT AGTCGTCGAT 780 

20 GCCGGCTTTC GCTTTTTGGC AGACAAGCGG CATGCCCGAA CCGGTGCCAC GGCTCTGACC 84 0 

CTGAATCTGA GATTCGACTA CGGCGATCCA TTCCGAAGCG AGACTTTCTC T C CAT AC GAT 900 

TTCTTCCAAT TCAAAGCCGG ATTGAGTTTC TCCGAATCGC AACCTCTGCT GAGCCAGATC 960 

AATCTGATCG GAATCCTAAG CGGATGCCAA CTGCTCGCAC ACGAACGAAC GGTTTTGGTG 102 0 

GGAGGTCTCT TTCAGCACTT CGACTACTAC AATTCGGAAA AACGAATAAGL CAAAAATTCG 108 0 

25 GAG GAGGTAC TCGTCACCCC ATACCGTATC TCGCAAGTGG CAGCTCTGGG AGGCGGTCTT 114 0 

ATCTTCCAGC AC CAC GGAAA ATTTCGACGA CGTCCTCTGG AGCTATATGC CGAGACCTAC 12 00 

CTGAATGTCG TCCCGATGGG AGCCAGTCTG TCGGATCACT ACAACGT GGA CAATCGGGAC 12 60 

TATAACCTCG GCAGCGGATT GAGCGGCAAG CTATACCTTG GTGCTACGTA CAATGATCTG 132 0 

TGGAGCTGGC TCTTGGGAGT CGAAAGCTAT CGGCTCTACA CATGGATCGG GT AT GAAGAG 138 0 

CCGCACCAGA AAAAT AC C G A TGTCAGCTCT TTTATGGTGC AGGGGGACGA AAGCAAGGCG 14 4 0 

CGCCTACTGG TGACGAGTTC CGAGTTCGCA TTTCATCCTG GCCCCTGGCA T GTAGC CAT C 1500 

GTCGCTCGCC GTTTCATCCG CAAAACAGCC TAT CAATT CT ACCCTAACGT ATCATTCGAT 1560 
ACCGGCGACA TACAGCTGCG TGTCGGATTT CACTTC 



1596 



35 



(2) INFORMATION FOR SEQ ID NO: 13 



33 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



15 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival!* 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...900 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



AAAAGAAGAA AAAAACAGAT GAAACGACTG ATTGTTTTTC TGGCAATGGG TGGCTTGCTG 
TTCACCCTTG CGAACGCACA AGAAGCAAAC ACTGCATCTG ACACTCCCAA- AAAGGACTGG 120 
25 ACTATAAAAG GTGTGACCGG ACTAAATGCC TCTCAGACTT CTCTGACCAA CTGGGCTGCC 



GGTGGAGAAA ACACGGTGGC AGGTAACCTC TATTTGAACA TAGATGCCAA CTACCTGAAA 
GATAAATGGA GTT GGGACAA CGGTTTGCGT ACAGACTTCG GTCTGACCTA CACAACAGCC 
AACAAGTGGA ACAAAAGT GT AGACAAGATC GAACTCTTCA CGAAGGCCGG C TAT GAGAT C 



TATCTCACTC TCGGTATTGG TGCGGACTAT AAGCCCAATG AGAAGTTCTC TCTCTACCTC 
TCTCCTACAA CGGGCAAGCT GACTGTAGTA GCAGACGACT ACCTCTCAAG TTTGGGAGCC 

TTCGGGGTGA AAGTTGGTGA AAAGAC AAT G TTCGAACTTG GTGCTTTGGT AGTGGGTTCG 

cznr a 7\m Am 7i * * m ^ 



AACAAGTTCC TCACGGCTAC GATAGCCACC AATCTTATCT ACGACGATGA TGTGAAGATC 



180 
240 
300 
360 



GGCAAACATT GGTACGGAAG TGCGCTTTTC ACTTTCCTCT CACAGTATGC CAAAGGATAT 4Z0 
GAGAAGCCCT CGGATCACTT GACAGGAGTC AAGCATATCT CTAATTTCTT CGCTCCTGCA 



480 
540 
600 
660 



GCCAATATAA ATCTGATGGA GAATGTCAAT TTGATAACCA AGGCTTCATT CTTCTCGGCT 720 
35 TATACGCACG ACTTTGGCAA CATTGACATC AATTGGGAGG CTATGCTGGC CATGAAGATC 



780 
840 



34 



AAC GAT GG C C CGAAAATCCA GTTCAAAGAA GTTGTGGGCG TGGGTGTTGC GTACACTTTC 
900 

(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...2919 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

TTTTACACCG GAATT GACTT GCATATAGAA AGCCAAATGA AGAAACTTTT C C CAT TATT A 
TT ACT CAT AT TGTCGATTTT GGTCGGATGT GGAAAAAAAG AAAAACACTC TGTAACTGAA 
ATCGCCNGAG AGAAAAAGCG TAT TACT G CA TTGCTGTACG AAAAAGAACT CCCCACTGAT 
TCTGTTAAAC AGCTTTACGA GAACAGCGTA CAGAACAAGA ACCTTGTGGG ACAAAT GTT A 
TTCGCGATCG AGGTCGGCAA ACGGATGCGT AATATGTCCC AATACACAGA TGCGATGCTA 
TAT C AC C AAA AAGGGTTGAA CGCTGCATTG AATCTAAGGG AC AC CAT C GT AGCCGCACAA 
GCATGGAATC ATCTCGGAAC GGATAGCCGT CGTATCGGTG CTTTGGCAGA AGCTTCGGAT 
TATCATTACA AGGCTCTTTC CTT GATAGAA TCTTTTAGCG G AAAC C AG AA TAGGCCTGCG 
ATCAAAGCCA GATCGGCGGC CCTGAACGGC AT CG GCAACA TCAATCTTGA GTTAGGATAC 



35 





CATGATGAGG 


CCGAAAAGAA 


TTTCCTGAAA 


GCACTGCAAG 


GT GAGAAAGA 


ACTCGACAGT 


600 




CCTCTCGGGC 


AAGCT AT CAA 


CTAT GCGAAC 


CT CGGACGT A 


T CTAT C G AC A 


ACGCAAAGAA 


660 




TACGACAAGG 


CTCGTACCTA 


CTTCCTCTTG 


TCTCTGGAGC 


AGAACAATAT 


GGCAGAGAAT 


720 




CTGATGGGTA 


TCGGACTCTG 


TAGCATCAAT 


CTCGGAGAAG 


TAGACGAAGA 


AAAAG GGGAT 


780 


5 


TAT C AAAAGG 


CTTTGCAAGA 


GTATGCCACG 


GCATACAAAC 


T GAT GGAACA 


GTTGTCCGAT 


840 




C GAT GGC ACT 


GGCTGAATTC 


CTGTATCCCG 


ATGGCACGTA 


TCAATCTCAA 


ACAAGGTAAC 


900 




GAAAGGCTCT 


ACCAGCATTT 


CATTTCTTTG 


GCCGAAGGGA 


CTGCGAAAGA 


AATTAATTCG 


960 




ACTTCACATC 


TGATAGAAAT 


ATACAATCTT 


C AATAC GAGA 


ATCTCGAGCG 


TAAAAAAGAA 


1020 




TACAAACAAG 


CCCTCGAAGC 


ATTCTGTCTG 


AGCAAGACGT 


TGAGCGACAG 


CATGTCCATT 


1080 


10 


GCGCACAAGG 


T C AG C AG CAT 


ACAAGAAACG 


CGATTCAACT 


AC G AAC G AAA 


CAAGTCCCAA 


1140 




AAAGAGCTTG 


AAGAAATACA 


GCAAGTAAGC 


AAGGCAAAAC 


AAGAGAAAT C 


GAAGTTTATC 


1200 




CTCTTGAGCA 


CTCTTTTTGC 


CCTTTTCATC 


TCGATTCTTT 


TGATTTCTGT 


TCTGACATAT 


1260 




G CAT AC C GT C 


AGGGCAAGAA 


GCATAACAAG 


CTGATCAAAG 


AGACGGATAA 


ACTTCGCTCC 


1320 




GGCTTTTTCA 


CCGGTATTAC 


ACAC GAATTT 


CGTACGCCTA 


TCACCGTCAT 


ACAAGGTTTG 


1380 


15 


AATGAGAAAA 


TGAGTTCAAG 


TCCTGATCTC 


CAAGCATCGG 


AC AG AAC C G A 


GCT GCACAAG 


1440 




ATAATAGACA 


GACAGAGTAG 


CCATATGCTG 


AATTTGGTGA 


ACCAGCTGTT 


GGATATTTGC 


1500 




AAGAT CAGAA 


GCGGAGTATC 


CACGCCCGAA 


TGGCGCAATG 


GCGACATCGT 


CTCCTTCGTA 


1560 




CAGATTCTCA 


TCGATTCGTT 


TGCACCATAC 


GCACAGGCTC 


AAGACATAAC 


CTTGGAGCTA 


1620 




CAACCCGAGA 


GCAAACCTAT 


TGTCGTGGAC 


TTCGTCCCCT 


CCTACTTGCA 


AAAAAT CAT A 


1680 


20 


TCCAATCTTT 


TGTCCAATGC 


CATCAAGTAT 


TCTTTAGCCG 


GAGGGAGAGT 


GGTCATATCT 


1740 




CT GGCAAAAA 


CCAAGAATGA 


AAAAAATCTG 


AT CAT AC GC G 


TTGCAGACAA 


TGGCATAGGA 


1800 




ATAGATAAAA 


CT GAT CAGG C 


TCATATCTTC 


GACATCTTCT 


AT C GAGGACA 


GTCCGCTACC 


1860 




GAAAAGCATG 


GATCAGGCGT 


CGGACTCTCG 


TTTAC CAATA 


TACTGGTCGA 


AAACCTTCGA 


1920 




GGTACGATCA 


AAGTGGAAAG 


CCAGCCGGGG 


AAAGGAAGTG 


CCTTCACCAT, 


CAGTATTCCT 


1980 


25 


ACACAAAACC 


AGTCCTCTTC 


GGCAGAGATT 


CTTCCTTGGC 


TACCCTCCTC 


CGATGACATT 


2040 




GTCATGCCTG 


TCCACATCGC 


GCCCGATGAC 


TCACCGACAT 


CTCCGATGGT 


AGCAGCTCTG 


2100 




AATCATCGCT 


TCGAGGACGA 


ACGTCCGACC 


ATACTGCTCG 


TCGAGGACAA 


TAAGGATATC 


2160 




AACCTGCTCG 


TCAAACTACT 


CCTTTGCGAT 


CGCTACAATG 


TGCTATCCGC 


CGCAAACGGA 


2220 




AAAGAGGGTA 


TAGCCCTCGC 


TACCGAGCAT 


ATTCCCGACA 


TTAT CATTAC 


GGATATTATG 


2280 


30 


AT GCCGATAA 


TGGATGGGAT 


AGAAATGACA 


ATCCGGATGA 


AGCAATCGCC 


TCTGCTCTGT 


2340 




CACATTCCCA 


TTGTCGCTTT 


GACGGCCAAG 


AGTACCGAAC 


AGGACAGATT 


GGAAGGAATC 


2400 




AAAAGCGGTG 


TAGTCTCTTA 


TCTATGCAAG 


CCATTCTCTC 


CGGAGGAGCT 


TTTGATGCGG 


2460 




ATCGAGCAGC 


TTCTGAAAGA 


CCGTGAGTTG 


CTCAAGAAGT 


TCTATATGCA 


AAAACTCATG 


2520 




CTGGATCGGA 


AGCCGGAGGA 


GGAGCCTCAA 


CCGATAGATG 


ACAGCAGTAT 


GCAGTTTCTC 


2580 


35 


CTTGCTGCCA 


AAGATGCAGT 


GTCCGGTGGA 


AT CAAAC AAA 


ATCCGGATTT 


TTCCGCTCAA 


2640 




GACTTGGCCG 


AAAAAATGTG 


CATGAGTCCA 


TCCCAACTCA 


ACAGAAAGCT 


CACGAGTGTC 


2700 



36 



5 



GTAGGTTGCT CCACCATCGG C T AC AT ACAG CAGATCAAGA TAAAATTGGC CTGCAAGCTC 27 60 

CTTGCCGATG AG AG CAAAAA CATCTCCGAC AT TAGCATT G AGGCAGGCTT TTCGGATCCG 2 82 0 

GCTTACTTCT CTCGCACCTT CAAACGCTAC ATGAACTGCT CTCCCTCCCA AT AT C G GCAA 2880 

AAACTCCTTG CCATGCCGGG GAGC GACAAG GAGACAGTT 2 919 



(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

15 (ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

20 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
25 (A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . . 681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

30 TATAAGCAAG CTATAATGAA ACATTTGTTT AAGTCGACAT TAGTACTTCT TTGTGCTCTT 60 

TCTTTTTCCG GTACCTATAC CTTTGCACAA GAAAATAATA CAGAAAAGTC ACGATTTGAT 12 0 

TTTTCTGTTA GGCTGGGACA GGGATATATT GCAGGTTCAA CTACCAACCT GATGTATGGG 180 

TAT AC AT C T G CTAAC GAT AG ACTTTTGTCT GGTGCAATTT ATCTGGGCTT GACACCAAGT 24 0 

AAGAAAGAAA ATGCAACCGG CGTAGCATTT CGTTTCTTAT CNCCCTCTCC GGGTTATTAT 300 

35 GT C GAT AT AT CCGGCAAAGA AAATACCTTG AATTATGCGT TTTACGTTGT C GGAGC AT AT 360 

AATAGAATAG CCATTCCTAT ACGCCCTATC AAAAATTTTA ATTTCATCTT CTCTACAGAA 42 0 



37 



GTCGGAATGG CTTGGATGAG TCGTCATGAG CAAATTTACA ATTCTACTTC GCAGACTTGG 
GATAAGCAGC GCAAGTCGAG GTCGGGACTG GATTTTGGTC TCGGGATGCA TCTGCAATNC 
CACATTAATA AGACCGTTTA CTTTATGGCA GGAACCGATC TTACGTCTTG CATGTTCGGA 
AAAAGGATCA AT GACTAC CA GCAAAAGGAT CGAACCTTCA TTGCACTTAT CGACAACAGT 
5 ATTGGCATAG GATTAAACCT C 



(2) INFORMATION FOR SEQ ID NO: 16 



480 
540 
600 
660 
681 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

15 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



20 (iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivals 

25 (ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1. . .705 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

30 



35 



AT GAAAAC AA 


TTAGTAAGAA 


CCATGCGGCA 


CGGATCTGTG 


CCGCCATTGC 


TTTGTTTGCA 


60 


GTGTGTAACG 


GCCGGATAGC 


TGCTCAGGAT 


TTTCTCTATG 


AAATAGGAGG 


AGGTTTTGGT 


120 


GCTGCTCAGT 


ATTTTGGCGA 


TGCAAACAGA 


GGCTTGTTCG 


GTTCATCCGG 


AGTAGGTTTG 


180 


GAGTTGGTCG 


GACGTTACAA 


TTATAATTTT 


CGCTGGGCTT 


T CAGTAC CAT 


GTTGGATTGG 


240 


CGTACATTGA 


GAGGCGATAC 


CGATAAGTCC 


GGGAATGTCT 


TCCCCGATTT 


TGCTCAAGCG 


300 


GATTTTAAGG 


TCGGCTTGAC 


TCAGCTCCAC 


GTTAGAAGCG 


AATTTAACTT 


TCTCCCTTAT 


360 



10 



20 



25 



38 



AGCGATGGCT ATAAGTATCT TGGTACAGCT CGGCTGTCTC CTTAT GTAGC GGCCGGGTTG 420 

TCTTTGGGTT TTGCTTCGGG TGCTAAAGGT TCGGCTTTTG CTCCCGGGAT TACTGCGGGA 4 80 

ATGGGAGTGA AGTATAAGCT TAAACCGCGG ATCAATGTCG GTATCGAGTA TTCTTTCACG 54 0 

GGGTTACTTA CCGATGCTTT GGATGCGCTG AC GGATAAAA GTGTTTGGCT C GAG GAT C C A 600 

T AT AAGAT C A ATGACTCCTG GGTCAAAAAC AAGGATGCTA CAGGGGCCTT AGTGCTTAGG 660 

AT T AC GT AT G ACTTCGGCCT GCGTAAGACT TTTTGTAATA AACAA 7 05 



(2) INFORMATION FOR SEQ ID NO: 17 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



( iii ) HYPOTHETICAL : NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .3486 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

GCTATTTCTC AG AT G AAAC G AAT ACT T CCA ATAGTCGCAT TCCTTTCTCT CTTCCTTGCC 60 

CTTGCTTTGC CTGCGAAAGC GCAACGAGCT AT GGGGAAGA CGGCCGACCG TTCGCTAATG 12 0 

GCTTCGGGAC ATTGGGTCAA GAT AC GT GT C GATGCAAGTG GAGTGTATCG CCTTACGGAC 18 0 

35 GAACAGCTCC GTGCCAATGG CTTCTCCGAT CCGTCCAAGG TAGGTGTGTT CGGTTATGGT 240 

GGAGGGGTGC TTCCCGAAGA TCTGAGCCGG AT C AC G AC AG ACGATTTGCC TCCGGTACCG 300 



4 



39 



GTACTCCGTC AGGGCAATGC GCTGTATTTC TATGCCGTGG GCCCGGTGAC ATGGTTCTAC 360 

AATCCGGCCA AAACCACCAT GGAG CAT AC G GTGAATACAT ACAGTACGCA TGGCTACTAC 420 

TTCCTGTCGG ATGCTGCCGG AGCACCTTTG CAGATGTCCC AATATACGGG TGGAGGTGCG 48 0 

TCGGCCGAGG CTTTGATCGA CTACTACGAT GAGCTGATGC TCCATGAACA GGAATTGTAT 54 0 

5 TCGCCCAAAG AATCGGGACG AGATCTGTAT GGCGAGTCTT TCAGTGCAGT CAATACGCGT 600 

ACGGTCAAGT TCCCTTTGAG GGGCAACACC CGCTCGTCTG GCGAACTCGG TACCGTATTC 660 

T CAT AC AT AG CCAAGGCCAG ATCGGCCGGT GGCGGCCGTG AGATGTCGCT CTCGGCGAAT 720 

GGCATTCTGA TCTTCAGCGA TCCTTTTTCC ATGACATCGA ATGAAGTCTC CAATTCCTAT 7 80 

TTGGCCGGCA AGAAGCGTCG TCTCTATCAC AGTACGCCGA TGAACAGCTT GGTCAATGAG 84 0 

10 TTGCGCTTGG ACGCGAACTA TAGCATGACA GGAGATGCGG TCAATCTGGA T TT CAT AGAG 9 00 

GTGGCTACAC AGAACGACCT CCGGTACGAT GGCGCACCCA TGCATATCAG GCGGTTTTCC 960 

AATTTGCCCG TTTTGGGGGG CGAGTCCTGC CGGTTCGTTA TCAGTGAGGT GCCGGAGTCT 1020 

CTGGTGGTTT TGCAGGCCAA TTCTTCCCTG ACAGCATCGC TTGTTCCCGT TAAGACTGTC 1080 

GGGGATAAGA CCATTGAGTT CGTGGCTCCG CCGAAGGGTC AGGATCGTAG GACTAT CAAT 114 0 

15 ACGTTTTATG CCGTGGACTT GTCACAGGCT TCTGCTCCGG AGATCCTCGG AGCGGTACCC 1200 

AATCAAAACC TGCATGGAGA GGAAATCCCT GAT CT GAT C A TTGTCTCTAC TCAGGCGCTC 1260 

CTCCTTGAGG CTGATCGACT GGCCACCTAT CGTAGAGAGA AAAACGGGCT GAAGGTTTTG 132 0 

GTCGTGTTGC AGGAACAGGT GTTCAACGAG TTTTCGGGTG GAACTCCCGA T G CT AC AG CA 138 0 

TACCGCCTCT TTGCCAAAAT GTTCTACGAC AGAT GGAAGG CAAATGCACC TGTGGGAGAG 14 4 0 

20 ACCTTCCCGA TGCAAATGCT TCTCTTCGGT GATGGGGCTC AT G AC AAC AG GAAGGTCTCC 1500 

GTAGCTTGGC AGAAACCGTA TCTCCAACAA ACGGAGTTCT TGCTGACATT CCAAGCCGTC 1560 

AATTCGACGA AC GT AAACAG TTATGTGACG GAT GATTACT TCGGCTTGCT GGATGATCAG 1620 

CCGGCCTCGG TCAATATCGG TTGGCGCAAT TATAATATGG CTGTAGGGCG ATTCCCCGTA 168 0 

CGTACTCCGG CCGAAGCTCG CATCGCAGTG GACAAGACCA TCCGATATGA- GGAG GAT CGA 1740 

25 GAGAGTGGTG CCTGGCGTAT TCGTGCCTGT TTTGCGGCAG ACAAC GGGGA CAAGCACGCA 1800 

ACCGAGACTT CCCGTTTGAT CGATACCGTC AAGCGTTATG CTCCTGCCAT CATGCCGGTA 18 60 

CGCGCCTTTC AGGACGTATA TCCGCATGTC AT CGAGAAC G GGTTGCACAG CATTCCGGGT 1920 

GCAAAGAAAA AGATGCTGGA AACCCTTCAG TCGGGTATTA TCCTGCTTAA TTATGCTGGT 198 0 

CATGGCGGTC CTGCCGGATG GTCGGACGAG CATTTGCTGA CGCTCAACGA TATACACAAA 204 0 

30 TTCAATTATA AGCATAT GCC CATTTGGATT ACTGCCACGT GCGACTTTGC CAACTAT GAC 2100 

AGTCAGACGA CCTCGGCAGG GGAGGAGGTT TTCCTCCATG AGAAGAGTGG CACTCCGATC 2160 

ATGTTCTCGA CTACGCGTGT C GTTTA.C AAT ACGCAGAATG AGAAGAT CAA TGGTTTTATG 2220 

CTTCGGCGTA TGTTCGAGAA AGCTAAGGAT GGGCGTTATC GTACGATGGG CGAGATTATC 22 80 

CGATCGGCCA AACAGGGGAT GCTCAGTACT GTTTTCCCCG ATT C GAT CAA CCAGTTGAGT 234 0 

35 TTCTTTCTGA TGGGTGATCC GTCCGTGCGT ATGAATCTTC CTACCCACAA AGTGCAATTG 24 00 

ACCGCAATCA ACGGGCAGGA TCCCGAAGGG CAGTATGGAA CTATTATGCT CAAGTCTTTG 2 4 60 



40 



10 



15 



20 



30 



GAACGGGTAG 


CTCTGAAGGG 


TAAGGTAACC 


GATGAAAAGG 


GGACATTCGA 


C GAG AC AT T C 


2520 


AGTGGCAAGG 


TTTTCCTGAC 


CGTCTTCGAT 


GGCAGAAAGA 


AAAT GACAGC 


TTTGGAAGAG 


2580 


GAG G GAAAC G 


ATCTCTCTCT 


TGTATATTAT 


GACTATCCTA 


AC GT GAT GT A 


TGCCGGTATT 


2640 


GCCGAGGTGA 


AAGACGGACT 


CTTCGAAACT 


TCGTTTATCG 


T AC C CAAGGA 


TGTGAACTAT 


2700 


TCCGAGCACG 


AAGGCCGGAT 


CAATCTTTAT 


GCTTATAACG 


AGAGCACAAA 


GGCGGAAGCC 


2760 


ATGGGGGTAG 


ACTTCTCCAT 


CAGAGTCCAA 


CCGGGTATTC 


CT GAT GAG GT 


AACGGAAGAT 


2820 


AATACACCGC 


CT GAAAT CAT 


AAGCTGCTTC 


CTCAATGACA 


GTACATTCCG 


AT C GGGAGAT 


2880 


GAGGTTAATC 


CTACTCCTCT 


GTTTATGGCC 


GAAGTATTCG 


ACTTGAATGG 


AATCAATATC 


2940 


ACGGGTAGCG 


GAGTAGGGCA 


T GAT AT T AC G 


CTTTGTATCG 


ATGGCCGTGC 


CGACCTGACC 


3000 


TACAACCTCA 


AT G CAT ATT T 


CACAAGTTCG 


GCTACGGATG 


CAGGTGTGGG 


CACTATTCTC 


3060 


ttpattzatap 

1 1 \*-J-\ x vj.tA x 


V^wVjV— XXX 


Pf^A ACf^Af^AT 


PATAPTHPPP 

v^-xA X .tAv- J. vj v^ V_- 


GACTGACGGT 

vJ.TAV-' X V3/V^ vj vD X 


TTGGGAPATT 

x x v_j v_i vj/iv^ri x x 


312 0 


TTCAATAATG 


CCGTCCATCA 


TGACTTTTCA 


TTCAGAGTGG 


TAGAT GGC AT 


TGCTCCGGAT 


3180 


GTGGCTGATG 


TGATTCTATT 


CCCGAATCCG 


GT AC GC GAGA 


GTGCTACGTT 


CCGAATCTTC 


3240 


CACAATCGCC 


CCGGAAGCGA 


TTTGAACGTG 


GCCGTGGAGA 


TCTATGACTT 


CACCGGTCGT 


3300 


CTTGTGAACA 


GTTTGCCAGT 


CAAGACCTAT 


TCGTCTTCCT 


ACGGAGAACC 


T AT AGAGAT C 


3360 


AAGTGGGATC 


TGACCTCCAA 


ATACGGAGTG 


AAGATCGGAA 


ACGGATTCTA 


CCTCTATCGT 


3420 


TGTGTGGTGA 


ACTCTCCCGG 


AGGACAGACG 


GCCTCCATGG 


CCAAGAAAAT 


GAT C GT G GT A 


3480 


GGACAA 












3486 



(2) INFORMATION FOR SEQ ID NO: 18 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



41 



(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 

(B) LOCATION 1. . . 1218 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 





AT ATT CAT AG 


ACCCCGATAA 


GAATACAAAA 


CAAAACGAAC 


GAAAT AT GAT 


TAT C AAG AAA 


60 




ATGCTGAAAA 


ATAAATTGGC 


CCCCTTGGCC 


ATACTGTTCC 


TTTTTGCTCC 


AAAGGCTATG 


120 


10 


AAGGCTCAGG 


AGCAACTGAA 


TGTGGTACAC 


ACCTCTGTGC 


CATCGCTGAA 


TATCAGTCCG 


180 




GATGCACGTG 


CGGCCGGTAT 


GGGGGATATA 


GGTGTGGCAA 


CGACGCCGGA 


TGCGTATTCA 


240 




CAGTATTGGA 


ATCCGAGTAA 


ATATGCTTTC 


ATGGATACGA 


AAGCCGGTAT 


TAGCTTCTCA 


300 




TATACACCCT 


GGCTGTCCAA 


GCTGGTCAAT 


GATATTGCCC 


TGAT GCAGAT 


GACCGGTTTC 


360 




TACAAATTGG 


GAACAGACGA 


GAATCAGGCT 


ATTAGTGCTT 


CTCTGCGTTA 


TTTCACATTA 


420 


15 


GGAAAGTTGG 


AGACTTTCGA 


CGAATTGGGC 


GAAT C CAT GG 


GAGAGGC CCA 


TCCCAATGAA 


480 




TTTGCTGTCG 


ATTTGGGCTA 


TAGCCGCCAG 


TTGTCGGAGA 


ACTTCTCCAT 


GGCTGTTGCA 


540 




CTGCGTTACA 


TCCGCTCAGA 


CCAAAGCACT 


CACAACACCG 


GAGAGAATCA 


GGCCGGAAAT 


600 




GCCTTTGCGG 


CGGATATAGC 


CGGTTATTTG 


C AGAAGTAT G 


TGCTACTGGG 


TAAT GCGGAG 


660 




AGCTTGTGGT 


CGTTGGGTTT 


CAACGTAAAG 


AATATCGGAA 


CGAAGATCTC 


CTATGACGGA 


720 


20 


GGTGTCACGA 


GTTTTTTCAT 


CCCTACTTCG 


TTGAATCTCG 


GGACGGGGCT 


GTTGTATCCG 


780 




AT C GAT GACT 


ATAACAGCAT 


CAATTTCAAC 


CTTGAACTTA 


GCAAGCTGCT 


TGTACCCACT 


840 




CCTCCTATCA 


TGGATCAAAA 


CGATCAGGCC 


GGGTATGAGG 


CTGCACTCAA 


GAAATATCAG 


900 




GAAACTTCTT 


CGATCAGCGG 


TATATTCTCT 


TCTTTCGGTG 


ATGCGCCGGG 


AGGACT CAAG 


960 




GAAGAATTCC 


GTGAGATTAC 


ATGGGGACTT 


GGGGCTGAAT 


ATAGCTAT GA • 


- CGATAAATTT 


1020 


25 


TTTGTTCGTG 


CCGGATATTC 


ATACCTGCAC 


CCCACCAAAG 


GCAATTTGCA 


GTACTTCACG 


1080 




GCCGGTGCCG 


GCTTCAAAAT 


GAACATATTC 


CGTATCGATG 


CTTCCTACCT 


GTTGTCTACG 


1140 




ATCCAGAGTA 


ATCCGTTGGA 


TCAGACTCTG 


CGGTTTACGC 


TTGCTTTCGA 


TATGGATGGA 


1200 




TTGCGCAATT 


TGTTCCAC 










1218 



30 

(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



42 



(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



( iii ) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 
<B) LOCATION 1 . . . 663 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



AC GAG AG AG A 


GTGTGTTACA 


TTGTAGAACA 


ATGATTTTGG 


CAGCTACTAT 


GCTGCTCGCA 


CCTGCTCTTA 


GACTGGATGC 


TAACTTTGTC 


ACGAGCGTGA 


ACAATAAGAT 


GATCGTAGGC 


CTTAGCAATG 


AT GGATT C T A 


TCTCGCCCCC 


ATGGAATCAC 


TAAGTGAAAC 


GACAACTCGC 


GGT AT GAGAT 


TTAGCTTTGC 


T G AC AAC AT G 


GCATAT GGT G 


TCGCCGGAAC 


GATTAAGACT 


GCCTTTGGTG 


ATAAC GGAT A 


TAACCGTTTC 


AGCTACGACC 


GTTATTACGT 


ACAAATTGGA 


GATGCTCCGG 


ATAAGACTTC 


TTTGCGTAAT 


TTC 







30 



AAACTCAAAA 


AAGAACGAAA 


AAT GAAG AAA 


60 


ACAATCGGTT 


TTGCAAATGC 


TCAGAGTCGT 


120 


GGTAGTAACT 


TAATGCAAAA 


AGT C GCAAAC 


180 


TTACGTGTTG 


GTGCTGCTGC 


TGAGTTCGCT 


240 


GGATTGGCCT 


ATACGATGAG 


AGGTGCTAAG 


300 


TTGCATTATC 


T GCAAATACC 


GGTGAATGCC 


360 


G CTATTT CAT 


TGGAAGCAGG-TCCCTATTTC 


420 


AAAGTTGCAG 


GCGTTACGGC 


TTCTGTAGAT 


480 


GACTTGGGCT 


TGGGCTTGTC 


TGCTGCCTTG 


540 


TAT GAGC AT G 


GATTGCTTAA 


TAT GTT GAAG 


600 


CATGACTTCT 


TTGTGGGTCT 


CGGTGTTCGC 


660 
663 



(2) INFORMATION FOR SEQ ID NO: 20 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1401 base pairs 

(B) TYPE: nucleic acid 



43 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 
15 (B) LOCATION 1...1401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 



20 



25 



30 



35 



AAAGGTACGT 


G G AAT AG AAA AAACCGAAGA 


. GAAGAAAT GA 


> AAC GGACAAT 


CCTCCTGACG 


60 


GCACTGACCG 


TCCTATCTTC 


GCTCTCCTTG 


CTTCGTGCAC 


AAAATGAATC 


CGAAGCATCA 


120 


ACCAATCCGA 


TGTCAGGCCT 


CTCCCTGGAA 


GACTGTATCC 


GGATAGC CAA 


GGAGCGCAAC 


180 


CTGAATCTGC 


GCAGACAGGA 


GATCGAACAA 


GAAAACCGAA 


TCATTAGTCT 


CGATGCAGCA 


240 


CGACACAGTT 


TCCTGCCCTC 


GGTCAATGCA 


GGCATCGGAC 


ACAACTATAG 


CTTCGGACGT 


300 


T C GAAAGAC A 


AAACGGGAGT 


AACCGTAGAT 


CGCTCCTCGA 


TGAATACCAA- 


- TCTCAGCATC 


360 


GGAGCTTCGG 


TGGAAGTATT 


CAGCGGCACA 


CGTCGTCTGC 


ACGACCTCAA 


GCAGCAAAAG 


420 


TACAACGTGG 


AGGAT GGT AT 


AGCCCGACTT 


CAAAAAGCGC 


GTGAAGACCT 


CAGCCTGCAA 


480 


ATCGCGGCTC 


T C TAT AT C AA 


TTTGCTCTTC 


CGTCAGGAAA 


TGACTCGTAC 


GGCAGAAACA 


540 


CAGTTGGCAC 


TGATTCGCGA 


GCAACGCAAT 


CGCACGGCCG 


AAATGGTTCG 


CGTAGGTAAA 


600 


TGGGCAGAGG 


GTAAGCTCCT 


CGACATAAAT 


GCCCAGATGG 


C CAAGGAC GA 


ACAACTTCTC 


660 


GTACAATATC 


GTTCGGAGGA 


GGAGCTGGCT 


CGTCTGGACT 


TGGGGCAAGC 


CCTCGAACTG 


720 


GAGCACCCCG 


AAAGCATTGC 


AGTCAAGGCT 


CCCGACACAG 


ACGTTCTCGT 


AG C AGAAAG G 


780 


TTGGGATCTC 


TCCTTGCTCC 


CGAAGAGATC 


TATCGCACGG 


CTCTCGGCTT 


GAAACCGGCA 


840 


CTGCATTCGA 


GCGAGCTGCA 


AATAGCTTCG 


GCACGCGAAG 


GTCTGGCCTC 


GGCTCGTGCG 


900 


GCATACTTCC 


CGACGCTCAG 


CCTCTCTGCC 


GGATACAGCA 


ACGGTTACTT 


CCGCGACCTC 


960 


GGCAAGGAGT , 


ATGCCGCCAT 


CAACCCCTCC 


TTCTCCGAAC 


AGTGGAAGAA 


CAACGGCAGC 


1020 


TACAGTATCG 


GACTCTCTTT 


G AAT AT C C C C ATCTTCTCTG 


C CAT GCAAAC 


GCAAGATCGC 


1080 



44 



GTTCGGAGCA 


GTCGCCTGCA 


AATACGCTCA 


AGCGAGCTTC 


GACTCGTCGA 


AGAGAAAAAA 


1140 


GCCCT CTATA 


AAGAGAT C AG 


G C AAG CAT AC 


AGCAATGCCG 


TGGCAGCCGA 


TAAGGCCATC 


1200 


GCAGCAGCCG 


AAAAC AG CAA 


GGCCGCTACG 


CTCAAGGCAT 


ACGAATACGC 


TCGCGACAGC 


1260 


TTCGAGGCAG 


GGCGCTTGTC 


TGCCTACGAA 


TATGCCGAGG 


CAAAAACAAA 


ATACGCCCTC 


1320 


AGCCAAGTGG 


AAGAACTTCG 


TGCCAAGTAT 


GACTT CAT AT 


ACAAAGC CAA 


AGTTTTGGAT 


1380 


TTCTATCAGG 


GCAAAGACTT 


C 








1401 



(2) INFORMATION FOR SEQ ID NO: 21 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
15 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



20 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

25 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...1308 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 

AATATAATGT ATAAAGACTA TAAGGGTTTG TATGCGTCGC TTCGGTGGTA TGCCCTGATC 60 

ATTGGGTTGC TATTTGCAGC AGAC GGT AT A CAGGCTCAGA ACAACAACTT TACCGAGTCG 12 0 

CCTTACACTC GCTTCGGCCT TGGCCGTCTC GGAGAAC GGA C GACT ATTAG TGGGCATTCC 180 

35 AT GG GAGGAC TCGGCGTCGG TCTGCGTCAG GGCACATACG TCAATGCCGT CAATCCTGCT 24 0 

TCATACTCGG CTGTGGATTC GATGACGTTT ATCTTCGATT TCGGTGCATC T AC C GGAATT 300 



45 



10 



15 



ACGTGGTATG CCGAGAACGG GAAAAAGGAC AATAGGAAAA TGGGAAACAT TGAGTATTTC 
GCCATGCTTT TTCCTATTTC CAAATCCATT GCTATGAGTG CGGGAGTGCT TCCTTACTCC 
GCATCCGGGT ACCAGTTCGG ATCCGTTGAT CAAGTGGAAG GAGGCAGCGT CCAGTACACC 
CGTAAATACT TGGGGACAGG CAATCTGAAC GAT CT C TAT G TCGGTATAGG TGCAACCCCG 
TTCAAAAACT TCTCAATAGG AGCCAATGCT TCATCCCTTT TTGGGCGATT CACACACAGC 
AGGCAGGTAA TCTTCTCCAC GGAGGCTCCT TACAATCCCG TACATCTCTC GACGCTGTAC 
TTGAAGGCTG CCAAGTTCGA CTTCGGTATG CAGT AT C AC C TTCTTCTCAA ATCAGATCGT 
TCGCTCGTTA TCGGTGCCGT CTATTCTCCG CGGGTGAAGA TGCATAGCGA GCTGACTCAG 
ATAAAGAAT C AGGTTCAGAA CGGTGTAGTA GTGGAGAGCG AAACCCAAGA AT AT AT C AAG 
GGAAT G GACT ATTATAC C C T GCCTCATACA TTGGGGATAG GTTTTTCTTA TGAAAAGAAA 
GATAAACTTC TCTTAGGAGC AGACGTCCAA TATAGTAAAT GGAAAGGCGA GAAATTTTAT 
AAATCCGATT GCAAATTCCA GGACAGAATA CGGGTATCTC TCGGCGGAGA GAT CAT AC C G 
G AT AT AAAT G CCGTTGGGAT GTGGCCTAAA GTTCGCTATC GCTTCGGTTT ACATGGTGAA 
AATTCTTACC TGAAAGTGCC GACTAAAGGC GGTGTATATC AAGGATAC CA TATCGTAGGT 
GCTGTATTCG GTATAGGAAT CCCGCTCAAT GACAGACGTT CGTTCGTAAA TGTCTCTCTT 
GAATATGACC GATTGATCCC GAAGGAGGGT AT GAT C AAAG AAAATGCTCT GAAATTGACC 
TTCGGCCTCA CGTTCAACGA GTCATGGTTT AAAAAGCTGA AACTGAAC 



20 (2) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2835 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



30 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1308 .. 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivals 
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(ix) FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .2835 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22 

AATACCATTT CGGAGAATGG G AAC GAT AG C ACCTATTTTT CTTTTCAAAC TTTTCTTATG 60 

CGATCGATTT AT CAAT TACT GTTGTCAATA CTCCTTGCTT CTCTTGGTTT CGTCGGGCTG 12 0 

GAAGCCCAAC AAGCCGGAGT AGCAGGTAGA GTATTGGACG AAGAAGGCAA CCCCATGATT 18 0 

10 CAAGCCAACG TACAGCTTGT AC AGAGT AC C GGCCAAGTAG CCGTTGCCGC AGGTGCCACT 240 

AATGAAAAAG GGTTGTTCAG CCTGAAAACG T C AC AG GAGG GTGACTACAT TCTGCGCGTT 300 

T CAT AT GT AG GTTACACTAC C C AC G AC G AA AAAATATCTC TTAGAAACGG GCAAACCATT 3 60 

ACGCTCAAAG ATATATCCAT GAACGAAGAT GCCCGTCTTC TACAGAGTGT GACGGTGCAG 42 0 

GCTAAAGCGG CAGAGGTCGT G GT AC G C AAC GATACGCTCG AATTCAATGC C G GAT C C TAT 4 80 

15 AC C GT AGCAC AGGGAGCTTC TAT C GAGGAA CTGATCAAGA AGCTACCCGG AG C AG AG AT C 54 0 

GGATCCGATG GGAAGATCAC CAT C AAC G G C AAGGACAT T A GCAAGATCCT TGTCGATGGC 60 0 

AAAGAGTTTT TCTCCAAAGA TCCACAGGTG GCAATAAAGA ATCTTCCGGC CGATATGGTC 660 

AATAAAGTAC AGGTACTGAA CAAACT GAGC GAGCTGTCGC GGATGAGCGG TTTCGATGAT 72 0 

GGAGAAGAGG AGACCGTAAT CAACCTGACG GTGAAGCCCG AAAAAAAGAA AGGCCTCTTC 7 80 

20 GGAACGCTTC AGGCCGGCTA CGGTACCGAC CAAC GCTAT A TGGCCGGAGG GAACGTCAAT 84 0 

CGGTTCGATG GAAATAAGCA ATGGACATTG ATCGGTAGTG CGAACAATAC G AACAAT AT G 900 

GGCTTTAGCG AGATGGACAG CGAGATGGGA TCCATGACCT TCTTCTCTCC CCAAGGCGGT 960 

GGTCGACGCG GCTTCGGCAA TAGTGGAGGT GTTACGTCTT CGTCGATGCT GGGCGGCAAC 102 0 

TTCAGTGTCG AATTCTCCTC TGCCCTTAAT ACAGGAGGCG ATGCACGCTA** CGGATACAAC 1080 

25 GACAAGGCCA TAG AG AC G AC CAAACGCGTG GAAAATATCC TCGCCGAAGG GAATACTTAT 114 0 

AT GGACGAAA AT AT ATT G GA ACGCTCTTTC TCTCACAATG GTCAGGCGCG ATTTAGGATG 12 00 

CAAT GGAAAC CGTCCGAACG TACCGAAGTG GTATTCGAGC CGGATCTTTC GAT AT C C AAG 1260 

ATCGATGGGT TCTTTAACGA CACATACGAG AC G AAAGAT G CCACCGGAAT CTCTATCAAC 132 0 

AAAGGTTCTA TCCACCAAAC TACACAAGGA AACAACTTCA GACTGAACGG AGAATT GGAT 1380 

30 AT C AGT C AC A AGCTCAACGA CGAAGGCCGT ACGATCAGTG CCTCCGTCAG TGGCGGTCTG 14 4 0 

ACC GACGAAG AC GGAGAT G G CAT AT AT C AG GCTGTGCTCC AAAGCGTGGA GACGAATCAA 1500 

AAGCAATTCA AC GACAACT C CAACCTGCAA TATCGGCTTC GCCTCTCGTA TGTGGAACCG 1560 

TTGGGTAAAA ACTACTTCGC ACAAGC GATT CTGAACAGAC GTTTCTCCCG TCGCAATTCG 1620 

GAT C GT GAGG TGTACCGACT GGGCGATGAC GGGCAATACT C CAT ATT AGA CAGTCAGTAC 168 0 

35 GGACTCTCCT ACAGT AAC GA GTTCACCCAG TATCGCATCG GACTCAACCT C AAG AAG AT T 1740 

GCCAAAACGT GGGACTACAC CGTAGGATTC AATGTGGATC C CAAC AG AAC TGTCAGCTAT 18 00 
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15 
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CGGAGCGTAG 


CCGGAGTAGA 


GCAG GACAAA 


CTGGCTTTCA 


ATCGTGTCAA 


TCTCTCCCCG 


1860 


ATGCTCCGAA 


TCAACTACAA 


ACCGAGCAGG 


ACTACCAACC 


TCCGAGTGGA 


CTACCGAGGA 


1920 


CGCACGACAC 


AACCATCCAT 


CAATCAGATC 


GCTCCCGTTC 


AGGACATCAC 


GAATCCGCTA 


1980 


TTCGTGACGG 


AAGGCAATCC 


CGGTCTGAAG 


CCGAGCTATT 


CCAACAATGT 


GATGGCCATG 


2040 


TTCTCGGACT 


TCGATGCCAA 


AAGTCAGCGA 


GCTTTCAACA 


TTGTTTTCTT 


CGGCAACTAT 


2100 


ACATTCGACG 


ACATCGTCCC 


CAATACGCAC 


TACGATCCGT 


CTACAGGGAT 


CCGTACCACT 


2160 


CGTTACGAAA 


ACGCCTCCGG 


TACGTGGCAA 


GCGAATCTTC 


ATGGGACACT 


ATCGCTTCCA 


2220 


CTCAAGAACA 


GGGCATTTTC 


TTTCAGGATG 


TCCTTGTTCA 


ACAGGTTGGC 


CGAAGGACAA 


2280 


AGCTTCATCA 


ATGACGATAA 


GAACAAAGCT 


CTCTCTTTCC 


GAACGAGGGA 


ACGCCTGACG 


2340 


CTGACCTATC 


GCAACAATTG 


GAT C GATACG 


AGTATCGGTG 


GCAATATCGG 


ATTCTATATG 


2400 


GCGAATAATA 


GTCTGAGCGG 


ACAGAAAGAT 


TCTCGCACAT 


ACGATTTTGG 


CGGCAATTAT 


2460 


CAAGTTGCCC 


TAACGCTTCC 


CTATGGATTC 


CGTATCGACA 


GCGATGTTGA 


ATACAATAC G 


2520 


AACTCCGGTT 


ACAGCGGAGG 


ATTCAGTCTG 


GACGAATGGC 


TTTGGAATGC 


TTCGCTTTCA 


2580 


TACAGCTTCC 


TCCGTGACAA 


GGCCGGTACA 


CTGCGTGTCA 


ATGGCTATGA 


CATCCTCGGT 


2640 


CAGCGGTCAA 


GTATCAGCCG 


TTCTGCTTCG 


GC CAT CAATA 


TAGAAGAGAG 


CATGTCCAAT 


2700 


ACGATCGGAC 


GCTACGTGAT 


GGTGGACTTT 


AT CTAC C GAT 


TCAACGCCTT 


CAGTGGTGGT 


2760 


GGATCTCGCA 


G C GAT CAT C A 


GCGTGGCAAT 


ATGAATCGTC 


CGGGCCCACC 


TTTCGGCGGT 


2820 


GGCAGACGAC 


CGTCC 










2835 



(2) INFORMATION FOR SEQ ID NO: 23 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2370 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE : 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1...2370 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

CTCTTGTTTT CTTCTCCTTT ACCCCGAATG GATCGTCCTA AGC CT T CAT A TATTGTTCGA 
AT AG C AG C C A TTCTCTGCTT GTTTGTCGGC AGGCCTTTGT TTGCGCAGAG CTATGTGGAC 
T AC GT C GAT C CGCTGATCGG GACGCTAAGT TCTTTTGAGC TGAGTGCGGG CAATACCTAT 
CCGGTGATCG GTTTACCGTG GGGAAT GAAT AGCTGGACAC CGATGACCGG TGTACCCGGT 
GACGGCTGGC AAT AT AC CT A CTCGGCACAC AAGATTCGCG GATTCAAACA GACCCACCAA 
CCCAGTCCTT GGATCAACGA CTACGGCCAA TTCTCCCTTC TTCCCCTTAC GGCACCGCAG 
AAGC CAT CAT CGAACGACTC CATAGCT CT G ACTAAATGGT GCAAGCAACT CTTTTCGGAC 
GAACAGACCT CGTGGTTCTC GCACAAAGCG GAG AC G G C GA C G C CAT AC T A TTATAGTGTC 
TATTTGGCCG ATTACGACAC ACGCGTGGAG ATGGCTCCGA CCGAGCGTGC AGCTATCTTT 
CGCATACGTT ATTCCGGCAA T AC C GAAAGT GGCTCCGGTC GATGGCTTCG TCTTGATGCC 
TTTACCGGTG GTTCGGAGAT TAGCATCGTG GATCCTCACA CCGTAGTGGG CATATCTCGC 
AAGAAT AG C G GAGGTGTGCC GGCTAACTTC GCCTGTTATT TCATCCTGCA GTCCGATACT 
CCTATGGCCG ATGTCCTGCT TGAGACAGAT ACCGGCAAGT CAGACGAAGG CACAAGGGCA 
TGGGCAGCCT GTCGCTTCGA TTCGCAAGAA GTTACCGTCC GGGTGGCATC TTCTTTTATC 
AGTGTCGAGC AGGCCGAAAG AAATCTTGCG GAAGTCAAAG GGCAGAGTTT CGACCGGATC 
AGACTTGCCG GTCGCGAAGC TTGGAATAAG GTGCTCGGAC GCAT AC AT GT GGAAGGAGGA 
AC GAAGGAT G AGCGCACTAC ATTCTATTCC GCACTCTATC GCTGTCTGCT- TTTTCCCCGT 
CGCTTCTATG AGGAGGATGC TTCCGGCAAT TTTGTGCATT ACAGCCCCTA CAAT GGAGAG 
GTACTTCCCG GTTATCTCTA TACCGATACC GGATTTTGGG ACACTTTTCG AGCCCTTTTC 
CCCCTGCTCA ATCTGCTGTA TCCCGATGAA AACATTAAAA TTCAGGAAGG TCTGCTGAAT 
GTATATCGCG AGAGTGGCTT TTTCCCCGAA TGGGCCAGTC CGGGCCATCG GGATTGTATG 
ATAGGCAACA ACTCTGCTTC TGTTCTGGCG GAT GC CT AC C TCAAGGGTGT TCGGGTAGAA 
GAT AC C C GT A CACTGATGAA CGGACTCTTG CAT GCTACGA AAGC CGT CCA TCCGAAAATC 
TCCTCCACGG GTCGCAAAGG TTGGGAGTGG TACAACTCCT TAGGTTATGT TCCGGCTGAT 
GCAGGCATCG AC GAAAGT GC TGCCCGTACG CT C GAAT AT G CTTATAACGA TTGGTGCATC 
CTCCGACTGG GGCGCACATT GGGTTGGGAT AGAG CT GC AT TGGACACGTT GGCTCATCGT 
TCGATGAACT AT CGT CAT CT GTTCGATCCG GAAACCAAAC TCATGCGCGG TAGAAATCAG 
GATGGTAGTT TCCGGACACC TTTTTCCCCT TTCAAATGGG GAGATGTATT CACGGAGGGC 
AATGCCTGGC ACTACACTTG GTCGGTCTTT CATGATGTGC AGGGGCTTAT CGACCTGATG 
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25 



30 
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GAGCAACAGC GTCGCGCTAA TCACAGTAAT 

(2) INFORMATION FOR SEQ ID NO: 24 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1356 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivals 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1...1356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24 



GATTCGGTAT 




TCCTATGTTC 


1800 


GAAAT C AG A G 




AGCGGATATG 


1860 


C AG CAT A TRA 


U 1 (a PA 


TAATCATGCC 


1920 


CGCGAAGTGA 


TGGGGCGGCT 




1980 


AACGGACAfIA 


rp rp /"« y-» « pp m — . 
L,l 1 L-CjCjCTTG 


GTACGTTTTC 


2040 


GCTACGGATC 


AGTATGTGCT 


CGGTTCGCCG 


2100 


GACGGACACA 


AAACGGTGTT 


GCATGCTCCG 


2160 


TCGATCAGCG 


T AGAAG G AAA 


AGAATGGAGC 


2220 


TCTTCTGCAT 


CCATTCAATG 


GAT GAT GGAC 


2280 


GAAAGT GACA 


GACCTTATTC 


CTTCTCCACG 


2340 








2370 
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CTAATAATCG AAAAGGAAAT GAAAACAACA 
AT GAT GT C AG GTGTATTGGG CGGAAACGCA 
TTTATCAGTA AT GAG C C T AA CGTCAAGTAT 
GGAACAATCT ATGTAACCGT GACAAAAAGG 
5 CAATTGGGTA TGTACTATCG ACCATTAGGT 
TATTTTGATG ACAAGATAGT TGCGGATATT 
TGTACGACTT CTTCTCGAGA TCAAGAGTAT 
AG GT GTAT AT TCAAAACTTC TGTGTCTACA 
TCGACAGGGG T G AC T TAT AT AGGTACCAGG 

10 TTCGAGTTCA ACACTATCTA TGAAGACTCT 
ACGAATAGTG G CAC CAT CT A TCTGGCATTG 
CTTACTTATC AAAACGGTGA GTTCGTCGAT 
ATTGCATCCA TGTGCTCTAA TAAAGAAGGT 
GGATTTATGA GTGGAACCCT T G C GAT C AG A 

15 GGCGGAGATA TACAGAATGC GATCGTTCAA 
GCTTGTGAAG TCTTCGGGAC TCCTAACGGA 
GACGCATCTG TCTTTGATTT TGAGTGGTAT 
GACACTTTCT TCTATAGCCC TTGGGACAAA 
CTCAGGAGTA AAGAGTCTTT TATAACCTCT 

20 GAT GT C TATA CTTTGGCCGG GAAGATAAGG 
TTGCTTTTCG ACCTGGCTGG CAGGATGGTA 
TCGGACATAG ATACTAACGG ACTAAAGCGA 
TCTTCCGGAC AGGTATT CAG TCATAAG GT G 

25 

(2) INFORMATION FOR SEQ ID NO: 25 



GTTCAACAAA 


TTATTCTGTG 


CCTGGCTTTA 


60 


CAGAGCTTTT 


GGGAAGAAAT 


AGCTCCTCCT 


120 


ATAATTCCCA 


ATATGGGGAT 


T GAT T C AAAG 


18 0 


ATTCAGCAGG 


GAGCAAATTA 


TACTTCTGAG 


24 0 


GATAATGAAC 


AGTGGTGGAA 


AC AT GAT C C G 


300 


CAG AC AG AT G 


CAT AT GGCAG 


AGTTTATGTA 


360 


CAACTTTATA 


TAAACGAGCA 


GAACGAATGG 


420 


TAT GAGCAT G 


GTATGGCTGT 


TTTTCGCTCT 


480 


CAT CAC AT C T 


TCGCATCAGG 


TGTAAATGAT 


540 


ACAC CT AT GA 


GCTGTCGCTT 


TGCAGAGGCT 


600 


ATGCATGAAA 


CCACAATGTC 


TACGACTATC 


660 


AT CT C GGAAA 


GT GAATT GAG 


TAACTCGATT 


720 


GATATAATAG 


CTCTTGTTAC 


T T C AT AT ACA 


780 


AAAGCAGATG 


AAGGCAAATG 


GCAACTT GTT 


840 


AATATATGCA 


T GAT G G AC GA 


CAACAAGATT 


900 


GTAGATGGTC 


GGACAAGGGT 


TTGTGTTTCT 


960 


GAAG AT G AAA 


TATACGGAGG 


C CT GAT AT T T 


1020 


CTTCTTTATG 


CGAAATTTGG 


TGGGATTATG 


1080 


TTCATTTCTC 


CGACAGTTGT 


ACAAGGAGTG 


1140. 


ATCGAAAGTG 


AAACTCCGGT 


GTCTGAGGTG 


1200 


CTTCGGCAAA 


C C ATT GAT AA 


T AAAAT CT AT 


1260 


AGCGGTATTT 


ACGTAGTCTC 


GGTGCGGCTC 


1320 


CAGGTA 






1356 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



35 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
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15 



20 



25 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 9 93 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 



GGCCTGTACG 


GTATGTCTGT 


TGTACCTATA 


ATAATTTATC 


TTTGCGGTAT 


AT C AAAT TAT 


60 


GCAAGACTCA 


TGATAATCCG 


GTGTCTTATC 


CGTCGTCCGA 


GAACCGTCCT 


GTTCGGGTTG 


120 


ATATTCGTGG 


TAGGTCTTTT 


CTCTGCGATG 


GCGCAAGAGA 


AAAAGGATAG 


TCTCTCTACG 


180 


GTTCAGCCAG 


TGCCGAATAG 


CAGCATGGTG 


GAGCAGACCC 


CTCTTCTCTC 


CAT T GAT C AC 


240 


CCCGTCCTGC 


CCGCTTCTTT 


TCAGAATACC 


CGTACACTGA 


AAAGGTTTAG 


AGACAAACAT 


300 


CTTTCCGATG 


CTTTGCTCAA 


TGGATTGAAG 


CCTCATCGCT 


CATCTTTGCA 


ATTGAATGAG 


360 


GAACTCAACT 


TCGCGGCAGA 


GCGTCGGGAT 


TTCGTTTCTC 


CCCTCTTGCA 


AACTCGCCAC 


420 


GCTGCCGGTG 


TCCTTTCATG 


GCGACCGACC 


GAT AG GAT GC 


ATTTTTATAC 


ATCGGGCAAT 


480 


ATCGGTCTTG 


GCCATGATTT 


ATTGACCGGT 


GTGCGCAAGG 


ACTTCGGATG 


GAATGCTGGT 


540 


GCCGACTTCT 


TGCTGAGTCA 


AAATCTTACG 


GCACATGTCC 


AAGGCGGTTG 


GCAGCAGAAT 


600 


TTCGGCTTTA 


TACCTATGAC 


GGCTGTCAAT 


GGCCAACTGC 


GTTGGCAAGC 


CACCGAGAGA 


660 


TTGAGTTTTA 


CCACCGGTAT 


CGATTATCGA 


CAGGTACAGT 


GGAATGCTTT- 


C GATAATAGA 


720 


ACGTTCTCGC 


TTAAAGGAAG 


TGCTCGATAC 


GAAGT GATGG 


ACAATGTCTT 


TGTCAATGGA 


780 


TTTGGCAGCT 


ATCCTCTCTA 


CAGCAGTACG 


CGCTCAGGAC 


TCAATATGGC 


TGTTCCGATG 


840 


CATGGATTCG 


GCCCTCAGTA 


CGGTGGATCG 


CTTGAGCTGA 


AAGTCTCCGA 


GCGATTCGGC 


900 


TTTGCCGTCG 


GTATGGAGCG 


C GAATACAAT 


ATCTGGACTC 


GTCGGTGGGA 


AACGCATTAC 


960 


TTTGCTTATC 


CTGTATTCTA 


TGGCGATAAG 


AAG 






993 
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(2) INFORMATION FOR SEQ ID NO:26 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyr omonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 
15 ( B ) LOCATION 1. . .744 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26 



20 



25 



30 



ATCAAACGAA 


TAGAAATGAA 


AAGGATTTTT 


ACTGTAGCCC 


TTGTGCTACT 


TGCTTCGGTC 


60 


ACTATGGCCA 


TCGGACAAAG 


CCGCCCGGCA 


CTTCGCGTAG 


ATGCCAACTT 


CGTAGGCAGC 


120 


AATCAGAGCA 


TGAAAAGAGA 


C G GAT AT GT G 


TGGGACACCA 


AAATGAATGT 


CGGCCTGCGG 


180 


GTCGGTGCCG 


CTGCCGAATT 


CAT GAT C GGA 


TCAAGAGGAT 


TCTACTTGGC 


TCCGGGTCTG 


240 


AACT AT AC G A 


TGAAGGGCTC 


CAAAAC CGAA 


T G GGATAT AC 


CCGAAATGGT 


TCCTGGTACC 


300 


TAT AT T AC G A 


TGGTTTCCAC 


TCGCTTGCAC 


TATCTGCAAC 


TGCCGATCAA. TGCCGGCATG 


360 


CGGTTCGACC 


TGATGAATGA 


CATGGCGGTT 


TCGATCGAAG 


CGGGTCCTTT 


CCTTGCATAC 


420 


GGTAT ATAT G 


GTACATATCG 


G CAGAAGTT G 


GAAGGAT GGA 


AGC CGAACAA 


CTACAGCACA 


480 


GAGTTTTTTG 


GCCCAACGCT 


TGGTGGCCCA 


ACAAATATCC 


GCTGGGACAT 


CGGGGCAAAC 


540 


ATAATAGCCG 


CATTCCACTA 


TAAGCGTTAT 


TATATACAGA 


TAG G C TAT G A 


ACAT GGATTT 


600 


GTGGATATTG 


TGTCAGGTGG 


AGGTTCTGAT 


ATTCCCCGAC 


TGAACGACAA 


TAGGCAATCC 


660 


TCTTCGACGA 


CCGCTCTAAG 


AGAAAAGGGA 


AATAACGAAT 


ACGCTTATAA 


TCGTGACTTC 


720 


TTCGTGGGCA 


TAGGTTACCG 


CTTT 








744 
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(2) INFORMATION FOR SEQ ID NO: 27 



(i) SEQUENCE CHARACTERISTICS: 



53 



(A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival!, 

15 (ix) FEATURE: 

(A) NAME/ KEY: mis c_f eature 

(B) LOCATION 1. . . 1689 



10 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

TCGCTAATCA ACAATTCAAG AGACTGGAGG GCTCCTGTCC GATTGAGTCT CAAAAAAAAG 

ACAAAAACTA TGAAGACAAA AGTTTTACGC AAATTCGTGG TGGCGGCTTT CGCCGTCGCA 
ACCCTCTGTC CTCTCGCCCA AGCGCAGACG AT GG GAG GAG ATGATGTCAA GGTGGTCCAG 



*- ~ * J. v-^vj^i^^>y^ 

GACCAAGGTG CCACATACCA AAAGTTGAGG GATTGGGATC CATCGGATGA TTATCAGTTT 
CAAGACTT C G ATATCGTGGT AACGGGTAAG AATGAATCCG ACATCAAGAT TTGGTCGGTA 
GAGCTCATGA ATAAGCCCGG AGGATATAAG AGTAGAGTTG CGGTCTTCAG TCGCGATGCC 



60 
120 
180 



TACAATCAGG AAAAACTGGT ACAAACGAGG ATGAGTGTGG CGGACAACGG ATGGATCTAT 24 „ 
25 GTAATGACCC ACAGTGGATA CGACACCGGC AATAGCAATG TGAAGATCTT CCGCTCCAAA 



300 
360 
420 
480 



AACGCGCAGA ATGCGAAACT CGTGTATAAG GAAGACTTCT CCAATGTGCA GTTGTACGAT 540 
30 GTGGATATAG CCTCCAACTA TCGTTCGCCT TCTTCTCTTA ACAATGGTGG CAACCCTTTT 



GCTTTGGCTT TCGCTTACAC CGGCTTCAAC AATACGCAGA AAATAAGTTT TGTGGACTAT 
GTGTTCTCTC TGAATGGAGG GCAAAATTTC AATAAAAACT TACTCTTCAG TCAAGATGGA 
GAGAAGAAAA TTGACAAGGT GGATCTCTCA TTGGGTAGCA CCTCTGAATC CATGGGTCAC 
AATGCCTGGC CGCTAATGGG TGTGGTATTC GAAATGAATA AACAAGGGGG AAAAAGCGAT 840 
35 ATCGGTTTCT TGTCGAACTT TGTCGACAAT GATCCCGAAT TTCAGTGGTC AGGCCCTATA 
AAAGTGAGTG AAAGCGACAT GTCGTTCAGC OCCAAAATCC AAATGTTGCT GGACGAGGAT 



600 
660 
720 
780 



900 
960 
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AACAATACGA TCAATGGGGA 
TCTGAATATT CGGATTGGGA 
AAAGGAAAAA CTCCGACTAT 
AGTGAGACCA ACTCGGGGCT 
GCCAAAAAAG AAGAGAACGG 
ATTCATAACA AAGATTTGTG 
TACACACCTC AAGTAGACAT 
TATCTGCCGG GCAAACGGAT 
GAAGAC AT C G TAATGCAAGA 
GCTGTGATTA GCCTGCCGAC 
AGAGTAGTCG CTGAGGCTTC 
GCTAAGGGTA CGTACATACT 
ATCGTGGAA 



GAGTTGCCAC AACTTCATGA 
CATTCGGTAT GT AT AT C C C A 
GGATGATCTG GTGGAAGCTT 
GGGGTATGAC AAGAAC G C C A 
TACGAACACG CTGAAATACC 
GAG C G AC AC A TTTACGTATA 
CAATCCGACC AAGGGTCTCG 
CGTTTGGTCT GATACGCAGT 
AGGCAGCATG AAGCTCTACC 
GGCAGCAAAC TGCAAGGCTG 
TTTCTCCGGC AACGAATACA 
CAAGGTCGTA TCCGATACGG 



TT AC GT AC AG C GAT TAT GAT 
AGAAATCGTT CAAGTAT GAA 
TCCTTACAGC TTCGTACCAG 
AT CACT AC CT GATTACATAT 
GCTGGGCCAA TTATGACAAG 
CAT CAT CT G C CAATGCTCTC 
TGTGCTGGTC ATGGGTGGAA 
GGACCCATGC CAACGGTGTA 
CGAATCCGGC TCAAGAATAT 
TT GT T T AC GA TATGCAGGGC 
GGCTGAACGT GCAGCACTTG 
AGCGTTTCGT AGAGAAGCTC 



(2) INFORMATION FOR SEQ ID NO: 28 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE : 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1. . . 1134 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



GGGATAATAT 


CTGTTCTTTC 


ACATGTGGTT 


GACCGTCCAC 


AATGGGGTGC 


TTCTCCCGAA 


60 


GCTGCTGGCA 


CGCATAGTGT 


GTATTCGATT 


CTACATCCCT 


CCGCCGGTAT 


TATCCGGATC 


120 


AGGTCTATGG 


GTATTATCTC 


AGCCTGCCGT 


ATCGCAATTC 


TTGCCGGCAA 


GCACCCCCGT 


180 


AGCGGTCTGT 


CGAGGGCAAA 


TGTAGGTATT 


CTTTCGTACA 


AT C C AG AAAA 


CACGCCCGAG 


240 


AAAAAGAGAA 


AACTGCAAGA 


AAAAAATGTT 


TTCCTCCAAA 


TCCGGCTCCG 


TCAATCATTT 


300 


AATAATTTGA 


TACCTTCGCT 


CCCATTTAGA 


AT C GAT AAC A 


CAAAAAAAAT 


CACTGAAATG 


360 


AAAAAAACTA 


CTTTGACAGG 


ATCGATATGT 


GCTTTACTCC 


TGTTTTTGGG 


TCTCTCGGCC 


420 


AATGCCCAAT 


CGAAGTTAAA 


GAT C AAGAGC 


ATT GAG GC AG 


CTACCACTTT 


CAGTTCGGCC 


480 


ACGGCCGGAA 


ATGGTTTTGG 


TGGCAATATC 


TTCGGCATGG 


ACATGAGCAT 


ACGGAT GAGG 


540 


GTACACCACA 


GCATTCTGCC 


CGAAGGGTTG 


GATTTTTCGG 


TAGGAATACA 


TGAAAGAAGA 


600 


GCACACTGGG 


AAGAGGCCGG 


AAGTCCGAAG 


CTCATGTATA 


CGAATGTCCC 


AAGT AT C ATT 


660 


GGTATTGTTG 


AAAAGGTAAT 


AGTCTTCGAA 


GAC GCAGAAG 


ACTTTTTTGA 


CAAAAAAGCT 


720 


CTCGGCCGCT 


TCCTCATCAG 


TTTGGGGATA 


TCCTATACCA 


AGCATCTGGG 


AG C GT ATT G G 


780 


GGATGGACCA 


ATGACGCCCA 


TATTCTTTTC 


T C AC C GAT AC 


CCAAGAGCAA 


GGTCCACTAT 


840 


GACAC CTACA 


CAAGAGCTGG 


CAGTGACCTT 


GTACTT CAGT 


CCGAAGATGT 


TGCCACAGTG 


900 


AGCAATGGCT 


TTTCACCGGG 


GATCGGACTC 


AAAAGTTCTA 


TTTGGTGGAA 


AATGCCCATC 


960 


AAGAGCAAAT 


ATGATTTTCG 


CCTCGGTTTC 


AGCCTGGGCT 


AT GAGT AT CT 


GAACCTGCTA 


1020 


TATCCGTATC 


GTAATTT CAA 


GCTGGATGGA 


AATAAGCCGC 


TTTCAGCACT 


ATCTCCTCGC 


1080 


AT G AAC C AC A 


TCGGCCATGT 


GGGCTTCAAC 


TTTACCGTGG 


GTCTTTGGAC 


TAAT 


1134 



(2) INFORMATION FOR SEQ ID NO: 29 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DNA (genomic) 



35 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



5 (ix) FEATURE : 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1 . . .858 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 

10 



ATTTGTGGCA 


GTAAAATGAA 


TCTATCGGGA 


TTACAATCTT 


TC ACT AT GAT 


G AAAT C CAT G 


60 


CGCAGCGTGC 


TGTTGCTACT 


CTTTCCATTG 


TCTTTGATCA 


CTGCTTTGGG 


CTGTAGCAAT 


120 


AACAAAGCTG 


CCGAATCGAA 


GTCTGTCTCT 


TTCGATTCGG 


CCTATCTCGA 


ACGCTACATC 


180 


CCTCTGCGGG 


CAGACATAGA 


TACGCCATCG 


CTGCATGTGA 


T GAT CAGCT A 


CGTCTATCCT 


240 


TCGGGAGATG 


AT AT G C T C AC 


AG AG AT T T T C 


AACGGTTTGC 


TCTTCGGCGA 


CAGCCTGATG 


300 


GATTCCTCTT 


CGCCGGAGAA 


TGCCATGGAA 


GGCTATGCAC 


AGATGCTGGG 


AGAAGACTAT 


360 


CGCTCTAACA 


ATGCCGAAGC 


CAATCTGCAA 


GGGCTTCCTT 


CTGACCTTTT 


GGACTATATC 


420 


TACAAGCAGG 


AAAAT AC CAT 


CGCTTATTGC 


GAT AC G GGAT 


TGATCTCCAC 


GCGCATCAAT 


480 


AC AT AT AC T T 


ACGAAGGCGG 


TGCACATACG 


GAGAATACAG 


TCCGGTTTGC 


CAACATCCTT 


540 


CGCACCACCG 


GCAAGGTGCT 


CGAAGAGCGA 


GAT AT AT T C A 


AGATCGACTA 


TGCGGAAAGG 


600 


CTGTCCGCAC 


T CAT CAT AGG 


ACAATTGGTG 


CACGATTTCG 


GCAAGACCAC 


ACCTGCCGAA 


660 


TTGGATGCAA 


TAGGTTTCTT 


CAACGCAGAA 


GAAATACAGC 


CCAATGGCAA 


TTTTATGATC 


720 


GAT G AC AAAG 


GTCTCACATA 


CTGTTTCAAT 


GAGTATCAGA 


TAGCTGCTTA 


TGCCAGAGGT 


780 


GCTGTCTATG 


TCCGTCTCGG 


ATATGACGTA 


TTGGCTCCTT 


TGCTAAGGGA- 


TGATTCCCCA 


840 


CTAAAGCGTT 


ACTTGCCG 










858 



(2) INFORMATION FOR SEQ ID NO: 30 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



35 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

5 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
10 (A) NAME/ KEY : miscjeature 

(B) LOCATION 1...1236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



15 



20 



25 



30 



35 



GGAGAGT AT C 


CTGCAAACAG 


TAACGACAAA 


AAAGAGATGG 


TTAT GAAGCT 


GAT T AAAAG A 


60 


AGTTTGCTCC 


TGCTTGGAGC 


GGTACTGCTG 


ATTACGCTTC 


CTGCGTACTC 


GCAGAATGAT 


120 


GACATCTTCG 


AAGAT GAC AT 


C TAT AC AT C G 


CGAAAAGAAA 


TAC GTAAACA 


AAAC CAAGTT 


180 


AAAGACTGGC 


AAAACCAAGA 


GGAC GGATAC 


GGCGACGATA 


CGGAATATAC 


AGTGGCTTCC 


240 


GATCGGGACA 


TTGACGCCTA 


CAATCGTAGA 


GATGGCCAGT 


CCTACGATGG 


GAAAAAGTTG 


300 


TCCAAAGACA 


AGAAAAGAGA 


CTCCACTCGT 


TCTTCTGTTC 


CCGGTCGCTA 


TAGTCGCCGC 


360 


TTGGCTCGAT 


TCTATAAGCC 


GAAT AC GAT C 


GT CAT TT CAG 


GTGCCGACAA 


T GT AT AT GT A 


420 


ACT GAT GAT G 


GTGAGTATTT 


CGTCTATGGA 


GAC GAATACT 


AT GAT GAC GC 


GTCGTCTGTA 


480 


AACATTTACA 


TCAACAGTCC 


TTGGTGCGAT 


CCGTTCCCTT 


ATAC GT CAT G 


GTATCCATCT 


540 


TTCTCCGGCT 


GGTACAACTA 


TACGTGGAAC 


TATCCATGGT 


TCTACTACGG- TAGCCATATC 


600 


GGATGGGGCG 


GTTATT AC CC 


CGGATATAAT 


TGGTATTGGA 


GCTACTACTA 


TGATCCTTTC 


660 


TACAATCCCT 


ATGGAATCGG 


TATGGGTTGG 


GGATATCCTT 


ATGGCTGGGG 


CAGCTATTAC 


720 


GGTTGGGGTG 


GCTATCCGGG 


AGTGATACAT 


CACTACCACC 


ACTACCCCAA 


GAAGACCTAT 


780 


TCCAATGGTC 


AGCATTCCGG 


AGCTTACTAT 


TCTTATGGCC 


GAC C GAAT CG 


TAT CAAAGGT 


840 


GGAACGTCCG 


GTGCCAAACT 


TGGGACAGGA 


CGCTACGATA 


GAATTCAAAA 


TTCGTCTTCG 


900 


CAAAAAAATA 


AGTTCGGATT 


GCAGTCGAAC 


AAACCCAATA 


ATAATCTGCA 


AAATGTCAAG 


960 


TCGGGACGTA 


CCGGCCGAGC 


CAATAGAGAC 


C G AAAT AT AG 


AAACGGTAAC 


TCCAAACAAC 


1020 


GGGCAAAAGC 


AGAATCGTCC 


CGTATTCCAG 


CAGAATCAGT 


CCGGCAATGA 


CCGACCGACC 


1080 


GGACGGAATA 


TCCGCAGCGA 


GAGACAGGGG 


GAAAATAACG 


AT AG GACATT 


TTCGACTCCT 


1140 


TCTCGTAGCA 


AT AGTAAC G G 


TGGCTTCTCC 


ACGCCTTCTC 


GCTCTTCTTC 


CGGCTCTATG 


1200 


AGCGGAGGTG 


GCGGACGTAG 


TGGCCGGGGA 


CGCAAT 






1236 
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(2) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
20 (A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1 . . . 633 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

25 



30 



35 



AAAAGAAAGA 


GTAT GAAAAG 


AATGCTGCTG 


CTTCTCGTTG 


TATTATTATA 


TGGAATTGCA 


60 


GGCCGATTGG 


CTGCACAAGA 


CGTTATCAGA 


CCATGGTCAT 


TGCAGGTCGG 


AGC GGGAT AC 


120 


TCCGATACGG 


AG AAC AT C C C 


GGGAGGATTC 


ACCTATGGTT 


TCTATTTGGG 


AAAGCGTATG 


180 


GGGAGCTTTC 


TGGAAGTGGG 


GCTGTCCATG 


TACAACTCCA 


CACGTCAAAC 


AGC CAACAAT 


240 


GCAGACTCCT 


TTGCATCGAA 


CGAAGGAGAC 


GGATCTTTTC 


AGGTAAATAT 


GTCTTCTCCG 


300 


AAT GAGAAGT 


GGTCATTCTT 


CGATGCAGGC 


AGTGCCAACT 


GCTATAT GAT 


CGTCGTCGGA 


360 


GTCAATCCTC 


TCCATCTGTT 


TTGGCAGAAT 


AGCCGGCACA 


ATTTGTTTCT 


GGCAGTACAA 


420 


GCCGGCCTGT 


CCAATAAGCA 


CAATATTCAT 


TT CAT CTAT G 


GAGACAAGGG 


AGCCAAAGTC 


480 


AGTAT CTACA 


CCAATTCGAA 


T AC CT AC AT C 


GGTTACGGAG 


CACGTGTAGC 


CTAC GAAT AT 


540 


CAAATTCATA 


AAAACGTGGG 


GGCGGGTGCC 


GCTGTAATGT 


AC GAC CACGG 


CAATAAGATG 


600 


CTTACGGCCA 


TGGCCACGCT 


CTCCACTCAT 


TTT 






633 
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(2) INFORMATION FOR SEQ ID NO: 32 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
20 (A) NAME/KEY: misc_f eature 

(B) LOCATION 1...2358 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



25 



30 



35 



ATCGTTTACC 


TTTGTCACTG 


TATGAACCAC 


AGACGATCAA 


AAACCATGCT 


GACGATCCGA 


60 


AACTTCCTCC 


TCTTTTGTTG 


TCTGTCGCTG 


ATAGCGTTTG 


CTGCCGATGC 


ACAAAGCTCT 


120 


GTCTCTTCGG 


GTAGACGACT 


GACAGAATAT 


GTCAATCCCT 


TTATCGGAAC 


GGCCAATTAC 


180 


GGTACCACGA 


ATCCGGGAGC 


AGTATTGCCC 


AATGGGTTGA 


TGAGCGTTAC 


CCCTTTCAAT 


240 


GTCAGCGGAT 


CGACAGAGAA 


TCGCTTCGAC 


AAAGATTCGC 


GTTGGTGGAG 


TGCGCCTTAT 


300 


TCGGCCGACA 


AT AGT TACT G 


CATCGGTTTC 


AG C CAT GT GA ATCTGAGTGG 


AGTAGGCTGT 


360 


CCCGAACTGA 


GT G GAAT ACT 


GCTGATGGCC 


ACTTCCGGCA 


CATTCGATCC 


TGATTACTGC 


420 


TGCTATGGCT 


CTTCGCTCAG 


TCGAGAATAT 


GCGCGCCCGG 


GAGAATACAA 


GGCTGTATTG 


480 


G AC AAAT AC G 


GTATAGATGC 


AGCCGTGACC 


GTAACCGAGC 


GGACTGCTTT 


GAC C GAATTT 


540 


GCTTTTCCCG 


AAGGAGAAGG 


CCATATCCTG 


CTGAACCTGG 


GACAGGCCCT 


AAGCAAT GAA 


600 


TCGGGAGCCT 


CTGTTCGATT 


CTTAAACGAC 


TCCACAGTCG 


TCGGCAGCAG 


GCTGATGGGG 


660 


ACGTTCTGCT 


ACAATCCGCA AGCAGTTTTT 


CGTCAGTATT 


TCGTACTTCA 


GGTGAGTCGG 


720 



60 



CGACCGATCT CTGCCGGCTA TTGGAAGAAG 
GATTCGACTG CAGGGAAATA TAAGCAGTAC 
GACATCGGTG TCCGATTCTC GTTCAACTGC 
GCCGTTTCAT TCGTCAGCGA AGCCAATGCG 
5 GTGTTCAAAA GTGTCGGAGG GAATCCGGCC 
AT AG AG C GT T GGGAGGAAGC CCTCGGTACG 
AAG AC GAT AT TCTATACCGC ACTCTATCAC 
GCCAATGGAG AATATCCTAT GATGGGCAGT 
TACACCGTGT TCTCTCTTTG GGACAC GTAC 

10 TAT C C GGAGA AGCAGTTGGA TATGGTACGG 
TGGCTGCCGA GATGGGAGCT GTACGGACAG 
CTTATCGTCA TCAATGACAC TTGGCAAAGG 
TATGAAGCCA TGAAAAAAAA TGCTTCTTCG 
AACGACGACT ATCTCACCCT CGGCTTCGTA 

15 TCGCATGCGC TGGAATACTA TCTGGCCGAC 
GGGCATAAAG AAG AC G C AG C TCTATTCGGA 
AATAAGGAGT ATGGTATGCT GTGTCCATTG 
GATCCCAAAC AGGGTGAAAA CTTCGAGCCT 
AACTATGCCT TTTTCGTTCC C CAC GAT AT A 

20 AAGGTTTTTT CGGAAAGGTT GCAGAAAGTC 
GAGC C GGACA TCGCCTATCC TTACCTCTTC 
CAGAAATTGA CCCGGGAGTT GATAGACAAA 
GGTAATGACG ATGCCGGTAC GATGAGTGCT 
CCTGACTGTC CGGGCAGCCC CACCTATACA 

25 ATTCGGCTCA ATCCGCAGTA TTATCCTCAG 
AATCAACCGA CAGATTCCAT TTAC AT C CAT 
CATGGAACAA GGCATAT CAG C CAT GC C GAT 
CTAAGCAATC GTCCTCGA 



CAGCCTCCTA 


TGACAGTGGA 


AGCCCAATGG 


780 


GACGGCTACA 


AGCGT GAGAT 


GAG C G GT GAT 


840 


GATCAGGGGG 


AAAAGAT CTA 


T GTAC GAT CG 


900 


CTCTATAATC 


TGGAAGCGGA 


G C AAG AAG AG 


960 


AAGGCTTTCT 


CCGCTATACG 


CTCTCGCGCT 


1020 


GTGGAAGTGG 


AAGGAGGCAC 


ACCGGATGAA 


1080 


CTGCTGATAC 


ATCCGAATAT 


C CT ACAAGAT 


1140 


GGCAAAACGG 


GTAATACGGC 


TCACGACCGC 


1200 


CGCAATGTAC 


ACCCGCTGCT 


CTGCCTCCTC 


1260 


ACACTGATCG 


AC AT GTAC C G 


AGAGAGCGGG 


1320 


GAGACCCTGA 


CGATGGAGGG 


CGACCCCTCG 


1380 


GGCCTTCGTG 


CT T T C GAT AC 


GGCAACGGCC 


1440 


GCAGGAGCGA 


CCCATCCGAT 


CCGTCCTGAC 


1500 


CCGCTTCGCG 


AACAGTACGA 


CAATTCCGTA 


1560 


TGGAATCTGT 


CCCGGTTTGC 


CCACGCACTT 


1620 


AAACGCTCGT 


TGGGCTACAG 


ACACTATTAT 


1680 


CTGCCGGATG 


GATCATTCCT 


CACTCCTTTC 


1740 


AATCCCGGTT 


TCCACGAGGG 


CAGTGCTTAT 


1800 


CAAGGGCTTG 


CCCGGCTGAT 


GGGAGGAGCA 


1860 


TTCGATGAAG 


GATATTATGA 


TCCGACCAAC 


1920 


TCCTATTTCC 


C C AAGGAAG C 


ATGGCGAACG 


1980 


CATTTTTGCA 


ATGCTCCTAA 


CGGCTTGCCC 


2040 


TGGCTTGTCT 


ATTCCATGCT 


GGGATTCTAC 


2100 


CTGACCTCGC 


CGGTATTCCC* 


■ CCGAGTTAGG 


2160 


GGGGAGTTGA 


TCATTACGAC 


CAATACAGAG 


2220 


ACGGTTTCTC 


TTGGCAATAA 


AACACTTCCG 


2280 


TTGGTGCGCT 


GCGGTCACCT 


CCGTTACGAA 


2340 
2358 



30 

(2) INFORMATION FOR SEQ ID NO: 33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2859 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: circular 
(ii) MOLECULE TYPE: DNA (genomic) 
5 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



10 



15 



20 



25 



30 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc feature 

(B) LOCATION 1...2859 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

ATCCGAATGA GAGTATCCGA TCTCTGTTCC AGACTTTCAT GGTTATTAC C CGTAATCCTT 
GTCGGATTGC TCTGTGCTAC TTTGGTCGCT GCGGAACGTC CTATGGCCGG AGCAGTCGGA 
TTGCACCACC GTCGGCATGC TGCGCTGTCT GATTCTACAG CGAAAGACAC GGTGCCTCTC 
GCAAAACCTA TTCCTGACAG TGCTTTTCGA GATTCCCTTC CTGCCGATTC C AC C G GAT C G 
ATGCGGCAAG ATAGCGTGTA TGACGATGAA TTCGAATTGG AAGATATAGT GGAGTAC GAA 
GCTGCCGATT CCATCGTTTT GCTCGGACAG AATCGTGCCT ATCTTTTCGG CAAGAGCTAT 
GTGAGCTATC AAAAGAGTCG CTTGGAGGCA AACTTCATGT ATCTCAATAC*- CGACAGCAGT 
ACGGTTTATA CTCGCTATGT CCTCGATACG GCCGGTTATC CGATGGCCTT TCCTGTTTTC 
AAGGAT G GAG AGCAGTCGTT CGAAGCCAAG AACTTTACCT ACAACTTCCG CACG GAGAAG 
G G GAT TAT C A GCGGAGTGAT CAC G CAGCAG GGCGAAGGCT ATCTGACTGC C G GTAAGACC 
AAGAAGAT G C CCGACAATAT CAT GT TTAT G CAAGGAGGGC GTTATACGAC CTGCGACAAT 
CAC GAT CAT C CTCACTTCTA TATCAATCTT TCCAAGGCAA AGGTGCATCC GGAGAAAGAC 
AT C GT CAC AG GTCCGGTCAA TCTGGTTATC GCCGATATGC CGCTGCCGAT AGGTCTTCCT 
TTCGGCTATT TTCCCTTTTC CAACAAATAC TCTTCCGGTA TATTGATGCC CAC GTACG GA 
GAGGACAAT C G C T AT GGATT TTATTTGAGG AATGGTGGAT ATTATTTTGC CTTCAGCGAC 
TATATCGATT TGGCATTGCG TGGGGAGATC TTTTCCAAAG GGTCATGGGG CATTTCAGCC 
CAATCGAAAT ATAAGAAGAG GTATAAGTAC AACGGCTCGT TCGAAGCCAA TTATCTGGTA 
TCGAAGTCCG GC GACAAAT A CGTGCCCGGA GACT AC AG CA AGAC CAC C AG TCTGAATATC 
CGATGGACAC ACAGTCAGGA TCCGAAGGCC AATCCTTTGC AAACGTTGTC GGCCAATGTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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AATTTTGCCA CCGGGAGCTA TTTCCAGAAT 
CGTACTGCTA CGACACGAAG TTCGGCCGTG 
TTTTCGATTA CGGGTAGCAT G GAT AT CAG C 
ACCTTGCCGA ATCTTTCGAT TAATATGTCC 
5 GTAGGACCGG AGCGATGGTA CGAGAAGTTG 
AGTATCTTGA CAAAAGAGAA AGATTTGCTC 
GGTATGCGTC ATTCCGTACC GATCAGTTTG 
ACTATGGGGG TTAACTACAA TGAGTGGTGG 
GAGGATAAGA AAACATTCCT GCCTTCGGAC 

10 TACAGTCTGT CGGCAGGCTT AT CTAC CAC A 
TTTTCCTTCG GAGGCAATCT CAT TAT GAT C 
TCCTATATGC CGGACTTCAC GAAACGCCGA 
GATCAGAACG GCAAGCT GC A TACGCTGCTC 
GCTCCCTCCA TGGGCAATGC AG GAT CT GT C 

15 AAGATCAAAT CCAAATCGGA TTCGACAGGG 
ACATGGTCTA CAT C CT ATAA TATGTTTGCC 
TCGCTGGCAC TTCGCCTCTC CAAGAGCTTT 
TATTTGACGA AGTATTATGA GGGAGAAGAT 
CTGCGCATTT TTAACGGCAA GGGATT GGCA 

20 TAT AC G C T C A ACAAAGAGTC GCTCAGCGGA 
CGGAGAGATG AAAAGAAAAA CACAGGGGCT 
ATACTTGAGG GAG GAAG AC C GCAAAAT GAA 
CAGGGCGGAG CAGTGGATCA GGATGGTTAC 
TTCGACTATA GTTGGAATAT TGCTACCGAC 

25 CACTACTACC GGGTAACGCA GAATCTGAGC 
TGGAGCTTCG GATTCAATGC GAACTACAAT 
TGCAACGTCA CTCGCGACAT GCACTGCTGG 
G CAT ACAAGT CCTATAATTT C GT CAT AT C G 
TATCAGCAGA GCAATCGTCC CAT CAC GAAT 

30 



TCGCTGAATA 


C CAC CT AT GA 


TGTCAATGCC 


1200 


AGCTATTCGC 


GCAAGTTTCC 


GGGTACTCCT 


1260 


CAGAACAT GC 


G C GAT AC G AC 


GGTGAGCCTT 


1320 


ACGCGTTATC 


CTTTCAAGCG 


GAAGACCCGT 


1380 


AGTGTGGGCT 


ATTCCGGTCA 


GCTTCGCAAT 


1440 


CAGAGCAATC 


TCGTGCGCGA 


TTGGAAGAAT 


1500 


ACTGTCCCTT 


TGTTGGATTA 


TATCAATCTG 


1560 


TACACGAAAG 


GCAT AC GGAA 


GT C GT G GAAT 


1620 


ACGACCTATA 


AATTCCGCAG 


ACT GT AC GAT 


1680 


TTGTACGGTA 


TGTTCAAGCC 


TTGGAAACCT 


1740 


CGTCATCGCT 


TCACGCCCAC 


TGTCAGTTTC 


1800 


TATGGCTTTT 


GGGAGCTTCT 


T GAG CAT AC G 


1860 


TACTCTCCTT 


ATTTCGAGCA 


GATATTCGGT 


1920 


AATTTCTCTT 


TTGACAACAA 


CTTAGAGGCC 


1980 


AT CAAGAAGA 


TCAGCCTGAT 


AGATCAGTTC 


2040 


GATTCGATCC 


GATGGAGCAA 


TATCTCGGCT 


2100 


ACCTTGCGCT 


TGTCCGGTCT 


GTTCGATCCC 


2160 


GGGAAGATCA 


TTCCCTATAA 


GAGCAAC GAC 


2220 


CGCCTGATCA 


GTACGGGTAC 


TTCTTTCAGC 


2280 


TTGATAGCTC 


TTTTCAGTGG 


C AAAAAG GAG 


2340 


ACTCCTCATG 


AAG GAGAC GA 


TGCTGCCGAT 


2400 


J~\\D X. VjVjvJOUO X 




GCGCAACCGT 


24 60 


TTCGCATATT 


CGATCCCATG 


GAGCCTGTCC 


2520 


TACAATAGGT 


ACAAT GT CAAr 


TAAGAT GGAG 


2580 


TTTCGCGGCA 


ATATCCAGCC 


TACACCGAAC 


2640 


TTCGACTTGA 


AGAAAATAAC 


ATCGCTTACC 


2700 


GCTATCTCGG 


C CAGTT T CAT 


CCCTATAGGA 


2760 


GT GAAGAGT T 


CACTCTTGCA 


G GAT CT GAAG 


2820 


ACTTGGTAT 






2859 



(2) INFORMATION FOR SEQ ID NO: 34 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1803 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 
15 (B) LOCATION 1...1803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 



20 



25 



30 



35 



AGTAATAGCA GCTCCCACAA GTGGTTAATT TATTATCATA TAGAAAAGAC T AAAAGT AT T 
ATGATCCGCA AGTTGATCTT ACTGCTTGCT CTGATGCCGG TAGCCTCTGT GGCTTTTGCG 
GTACCAACGG ACAGCAC GGA ATCGAAAGAC AATCGTATCC TTACAAG CAT GCAATCCTCC 
TCTTTGAATA GGGATGATGC TCCGGATAAA TGGCAACCTA TGCATGCCAA TTTCAGTATT 
CAGAGCGATA TGCTGCTTTC TACTGCCCAA AAGT C CAAGA ACACCTGGTT CGGCAACTCC 
TAT AT CAT GG GTATAATCAA GAACAATTAT CTGGAGTTTG GTGCCCGTTT- CGAGGATCTC 
TATAAGCCCC TGCCCGGACA TGAACCCGAG ATGNGGCGTG GCGTTCCTCA CATGTATGTG 
AAGGGAAGCT ATCATTGGGC GGAGCTGACT ATGGGAGACT TCTACGATCA GTTCGGTAGC 
GGTATGGTAT TCCGCACCTA TGAAGAGCGC AACCTCGGTA TAGACAACGC GGTTCGCGGC 
GGACGTATAG TACTCACTCC TTTTGATGGA GTGCGTGTCA AGGGTATTGC AGGACAGCAG 
CGTAACTACT TCGACCGCAC GGGCAAGGTA TTCAATTCCG GCCGAGGCTA CCTACTGGGT 
TCTGATCTGG AGCTGAATGT AGAGCGTTGG AGCAGTGCCA TGCGCGACAA T GACTAT CAT 
TTGGCTATCG GGGGATCGTT CGTTTCCAAA C AC GAAG C AG AC GAAGAT AT ATTTGTGGGT 
GTAGGCGAAG ATCGCAAGCG ACTCAACCTG CCGCTCAATG TCCCGATTAT GGGCCTGCGC 
ACCAACTTTC AAAAAGGAGG TCTCGCCCTC TACGCAGAGT AT G G AT AC AA ATACAACGAT 
CCCTCGGCAG ACAATGACTA TATCTACCAC GACGGACAGG CTGCACTCCT CTCTGCCTCA 
TACTCCAAAA AAGG GAT GAG TATCCTGTTG CAGGCCAAAC GTTGTGAGAA CTTTGCTTTC 
CGCAGCAAGC GAAGTGCCCA GCTCACACCG CTTAT GAT CA ACTATAT GC C GGCTTTTACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 



GCCATTTTTG TCGTATCATT GCAAATTGAA 
AAGAGC AT GA ACAAATTTTA CAAATCACTT 
5 ATGGCAACTG CACTGACCGC TTCTGCACAG 
TCTTCAAGAT CCGCCGGAAC GCATTCATTC 
GATTTCAATC CGGAAGACCT GAT C G C AC AG 
CCCGTCCGGA TAGGACAAGT AATACCGGTG 
ATCTCTTCCA TCGGAGACGT AGAT GT AT AT 
10 GCCATTACGC T TTATT AC GA T GC ATT CAAT 
ACCCCCGACC AT GAAATT GT GTTGGGAGCA 
GCTTTTGCCA CAGAGCCGGT ACCGGGGAGT 
GGAGGGACTT TGCCTGACAT CAAGATCTCC 
GGACGCCCCG TAAC GGATAA CCATTACGGG 
15 GAGATCAACA TCAATTGTCC TGAAGGTGCA 
CAAATGATCA TGGTAAAAGG ACAGTATAT C 
AC G AAAG GAG ACTTTACTCC GCT GAT CAT T 
AATTTCGGTG TAAC GCAAT C CGAGTTGGAT 
AGAGGATGCA GCAATGGTAC ATT GGC CATC 
20 AT GAAGGCTT TCCTCCCGAT CAAAGGTAAA 
GAAGTCCCTC TGCGCTATCG TGTCTATTAC 
TCGAGCGGTG CCGGTATTCA TCATCCGGCC 
AAGACTCCGG CTCTGAATAC ATGGATCTCC 
TTCTATTTCA AAT AC GAT C A AG GT G G T AC G 
25 AAT C AGAATA AG CAC GT G GT CGGCACACTG 
GAGTTCTACG GCAGACTGAA CAGTCATTGG 
CGCATGGACA TCTATCTGGA TCCCCAAAAC 
TATCGTGACG GTTATAAGCC TTTGCCCTCT 
GATCAGGTCG AATTGAATTG GACGGCTGTT 
30 GTCGAATACC AC AT AT T C C G AAAT GGAAAG 
TCGGATGCCA T C G AC G AAAG TAT TAT C G GT 
CGCTTCATTT ATCCCTCGCC GTTGGATGGA 
TCTGCCGACC TTGCCATAGG AGACATTCAG 
CCCGGAGGAG GAGTATCATT AAGCTGGAAA 
35 TTCGGAGAAA GCCCCAATCC TGTGTTCAAA 
G C C GC AC AAA CCCCCAATCC TCCCGTTGGC 



AAAATAACAG AGAATAAGTA TAATT C AGAC 60 

TTGCAGTCAG GACTGGCTGC CTTCGTGTCG 12 0 

ATTTCGTTCG GAGGGGAACC CTTGAGTTTC 18 0 

GACGATGCAA TGACTATCCG CCTTACTCCG 24 0 

AGCCGTTGGC AATCGCAAAG AGAT GGC CGG 300 

GATGTGGACT TTGCATCCAA GGCTTCGCAC 3 60 

CGCCT GCAAT TCAAGTTGGA AGGAGCCAAA 420 

ATTCCGGAGG GCGGACGCCT CTATAT CT AT 48 0 

TAT AC GAAC G CCACTCATCG CCGCAACGGA 54 0 

GAGCTTATTA T GGATT AT GA AGTGTCTCGC 60 0 

GGTGCGGGTT AT AT ATT C GA CAAAGTCGGC 660 

ATCGGTGAGG ACGATTCCGA TTCGGATTGC 72 0 

GACTGGCAGG CAGAGAAGAA CGGTGTGGTG 780 

T CAAT GT GCT CAGGCAACCT G CT CAAT AAT 84 0 

TCTGCCGGAC ACTGTGCTTC CATAACAACC 900 

AAGTGGATCT TCACTTTCCA CT AT GAAAAA 9 60 

TTCCGTGGCA ACAGTATCAT CGGAGCTTCC 102 0 

TCCGATGGTC TCTTGCTGCA ACTCAACGAT 10 8 0 

AAT G GAT GG G ACAGTACGCC CGATATTCCC 114 0 

GGAGATGCCA TGAAGATTTC CAT C CTAAAG 12 00 

TCCAGTGGTT CCGGAGGGAC T G AC GAT CAC 12 60 

G AAG GAG GAT CGTCCGGTTC -TTCTCTCTTC 1320 

ACCGGAGGTG CCGGCAATTG TGGCGGGACG 13 8 0 

AACGAGTATG CAT C C GAT GG CAATACGAGC 14 4 0 

AAT GGC C AG A CGACCATCCT CAACGGAACG 1500 

GTGCCCCGGC TATTGTTGCA GTCTACAGGC 1560 

CCTGCCGATC AAT AT C CAT C AT CTTAT CAG 1620 

GAAATAGCTA CGACAAAGGA GTTGTCCTAT 168 0 

AGCGGTATCA TT C GAT AC GA AGTAAGCGCA 17 4 0 

GTGGAATCTT AT AAG GAT AC GGACAAGACT 18 00 

ACCAAGCTGA AGCCGGACGT AACACCTCTC 18 60 

GTTCCTTTCT TAAGCCAGTT GGTTTCCCGA 1920 

ACCTTTGAAG TGCCCTATGT TTCTGCCGCA 198 0 

GTAGTCATTG CAGACAAGTT TATGGCCGGT 2 04 0 



15 



AC AT AT C C C G 


AAAAGGCTGC 


TATCGCTGCC 


ACTTTCCACC 


TCTTCCTCAA 


/— • 71 ft t~* 7\ 1", TV f* TV 

GAG C AAC AC A 


TCCGATTGGC 


AGGCCGGAAC 


ATGGTTGAGG 


AAT G AC CAT A 


TGCTTTTTGC 


C G GT AT CAGA 


ATCCGTTATG 


T AAG AAAT C C 


GGATAACCTT 


AAC AAC G GAG 


TCTCTTTCGA 


AGGCTACGGA 


AAATATCTGG 


TGGTAAATAC 


CGATGCTCCG 


TATGCTAAGG 


GAACGAATGT 


GGCTCCATTC 


AAC G G AAC AT 


TT AT C G G C AC 


ACAGGATCCA 


ACAGAGAGCG 


AT GAATAC GA 


AATAAAACTG 


GTTGCTCAGA 


TTGAGAATAA 


CAATGCTGTC 


TTCAGCATTA 


AGAACGCTCA 


TATGGTTCAC 


CAGGTTCGTT 


CTTGGAACAA 


CCTCCGCAAT 


GCCGGTACTT 


ATATGCTCGT 


TATGCAGACG 


AAGCAG 
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/-rpfTim TV rp /~>rp 7\ TV 

\j 1 1 I /VI K3 1 AA 


1 uUL.nl n— ■ v_ o V-' 


TCCGGACTCT 


2100 


AAUAtjAAtjA 1 






p 1 fin 

£ 1 u u 


7\rp/-i7\ A T"T"T I 


tat tv arrrCTT 




? ? ? n 


7\ cp r'pr'T 1 A T\ rp 7\ 

A J. bLLlAAlA 


Atj 1 ALAAb L, 1 




9 9 ft n 


TTCTCCATTA 


CL-GG 1 AACjAA 


f 7A r P r P T 1 C 2\ T 1 TAT* 
on ill 1 /\ i 


Z O T v 


ATACCC 1 CGC 


ILl 1 GGCjL, I A 


1A1 bbLI/VlL 


9 zi n n 

Z *i u u 


AAGAT C GAT A 


TGTCGC I 1 G 1 


ALACjCjAlaL-L- 1 


9 /i n 

Z fi D u 


CCCGAATTGG 


TCGGCATATA 


TGTCTATAAG 


2520 


TCCGTCACAA 


CTTATTCGGT 


TTCAGACGGA 


2580 


GTATATAAGG 


GAT C GG G CAT 


TTCGAATGGC 


2640 


GTTGCATATC 


CGTCTGTTGT 


AACAGAT C GT 


2700 


GCTGCCGCCC 


TCTACTCATT 


GGATGGCAAG 


2760 


GGCGTGACAT 


TCAGTGTTCA 


AGGACTTACG 


2820 


GCAAACGGCC 


CTGTGAGCCA 


AAAGATCGTG 


2880 
2886 



(2) INFORMATION FOR SEQ ID NO: 36 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3936 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

25 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



35 



(ix) FEATURE: 

(A) NAME /KEY : misc feature 



67 



(B) LOCATION 1...3936 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 



60 
120 
180 



AAGAGGGTAG ATAGCTATGA ATGCAGACAA AAAGCTTGCA AATGTGCAAT TTGTGTGATA 
CAAAAATTTA CTAATGTAAA ACTAAATGAT ATGCGAAAAA TTTTGAGCTT TTTGATGATG 
TGCTCTCTGC ATTTAGGTCT ACAATCTCAG ACTTGGCATG GAGATCCGGA CTCAGTGGCA 

GCCCTACCTT CTATCGGTAT TCAAGAGTCA AGTTGTACCC GAATCACGTT CGAGGTTGTT 240 

TTCCCCGGAT TTTATAGTGT GGAAAAAC GA GAAGGCAACC AAGTCTTTCA GCGCATTTCC 3 00 

ATGCCGGGTT GTGGCTCGTT TGGGAATCTG GGC GAAGCTG AATTGCCTGT TTTGAAAAAG 360 

ATGATAGCCG TTCCGGAATT TTCAACAGCT AACGTTGCTG TAAAAATCAA AGAGACGGAG 420 

ACATTCGACA ATTATAATAT CTATCCTAAT C CT AC C TAT G TCGTAGAGGA GTTGCCTGAG 4 80 

GGGGGGACTT ATCTGGTAGA GGCTTTCGCG AT AAAC AAT G ACTATTATAG CCAAAATGTA 54 0 

AGCCTCCCTT CTACTCACTA TGTCTATTCT CAAGAC GGGT ATTTTCGCTC ACAAAGATTT 600 

ATCGAAGTTA CCCTGTATCC TTTTCGATAC AACCCTGTCC GACAAGAAAT TCTATTTGCA 660 

AAAAAAATCG AGGTTACAAT AACTTTCGAT AATCCTCAGC CACCTTTACA AAAAAACACC 720 

GGCATATTTA ACAAAGTAGC CTCCTCTGCA TTTATTAATT ATGAAGCTGA TGGCAAATCG 7 80 

GC GATAGAAA AT GAT AT GGT GTTCAGTCGT GGTACAACAA C GT ACATAAG CGGAAATGTT 84 0 

GCCAGCAACC TCCCTCAGAA CTGTGACTAC TTGGTTATTT ACGAT GAT AT GTTCAACGTA 900 

AATCAACAAC CACACGACGA AATCAAACGG CTGTGCGAAC ATAGAGCCTT CTACAACGGC 960 

TTTGATGTAG CTGCTGTAAG TATAAAGGAC GTATTGAATA GCTTCCCATC AAATGCCACC 1020 

TCATACATCA ACGAAACTAA ACTGAAAAAT TTCATTCGCT CAGTTTACAA C CAAAG CAAT 1080 

GCGAAGAGGA CTTTAGATGG CAAACTGGGA TACGTGCTAC TGATCGGAAA ACCATTGAGC 1140 

AAATATTTGG CTGACACTGA TAATACAAAA GTCCCAACCT CTTTTATTCA- TAATGTCTCC 12 00 

TTAATTCCAA GTCATCCAAC TTTTGGTTCC ATATGCGCCT CCGACTATTT TTTTAGTTGT 12 60 

GTTTCGCCCC TT GAT ACT GT CGGCGATTTG TTTATCGGTC GATTTAGC GT CACCAATGCT 132 0 

CATGAATTGC ACAATCTGAT TGAAAAGACT AT CAACAAAG AAAT CT CAT A TAATCCTATT 138 0 

GCACACAAAA ATATTCTTTA CGCAGAAGGG AAAGGCTGCG ATGCTCCAAT CTTACGTTTA 14 4 0 

TT CTTAAAAG AAATCGCCTC TGGTTACACA GTCAACTCTA TCTTAAAATC TAATCAGGTC 15 00 

TCTGCAATAG ACTCGATATT TGACTGCTTG AATAAT GGTT CCCATCATTT TTATTTTAAC 1560 

AC T CAT GGAA TGCCGACTGT TTGGGGGATA GGGCAGGGAC TCGACGTCAA TACTCTAACA 162 0 

GCCCGATTGA ACAATACATC TTCGCAGGGA TTATGTACGA GTCTATCATG TAGTTCGGCT 168 0 

GTAGCAGATT CAACTATTAG ATCGCTTGGA GAAGTCCTGA CCACATACGC ACCTAACAAG 174 0 

GGATTCTCGG CTTTCTTAGG AGGAAGCAGA GCCACCCAAT ATGCCGTTTA TTTAGAAGGC 18 00 

CCCTGTCCTC CGTCAGAATT TTATGAATAT TTACCTTATT CTTTATATCA CAATCTCTCG 1860 

ACTGTTGTTG GCGAAATGTT GCTATCATCC ATTATCAATA CTAATTCTGT T GAT AC GT AT 1920 



68 



TCGAAATTCA ACTTCAATTT GCTTGGCGAC 
GAGGTTAGTA ATTGTATTAC ACTACCAAAC 
AAAAATGGTG GCTGCCTAAA AAT AC C G GAA 
TCCATACAAG TCATGTCCGG AGGAACTCTG 
5 GAGACCGGTG CTAACCCCAC CTTTATTACC 
CAGGTAGAGA TAGACAATAT AGACCGACTT 
AAATTTCATT TTGACAGTGT GAAATTCAAC 
GTGGAGATAA GCAATTGCGA ATTT AC CAAT 
CTAAGCGTTG AAAACAGTAT GTTTAGCAGT 
10 ACAAGCTCCA TCACCGGATT AT CT ACAAAA 
ACAGGAAACT TCGCCTACCA TATCACAAAC 
GC CAT CAAGT TAGACAATAT TCCTGAGTAT 
GATGAGGCTC TTGTACTAAA TAATAGTGGC 
CGGAATGTGA TAAAAAACTG TAGGATTGGG 
15 AACCGAAATA AGATCAGTAA CAATCATATA 
TATTTCGATA ATGCTCCTGT AAT CAAT GAA 
ACTTGGCAGC T CTAT T CAT C AAAC GGTACA 
CAGGGGGGAG ATACAGATAC ATGGATTTAC 
GTTTCAAATA ATCACTGGGG CAACAATGAT 
20 CCAGACTTGT TCATTTGGAT ACCTTTTTGG 
AGCTCTGCTG AAGCAGTAGA ATT C CAAACA 
CTTTCGGCAA AAGTGGCTCT CAAGATGATG 
ATAGCTGCTT TGAAGGAATT GTTCAGGATA 
TTGAAAGATT ATTT CAGAT C CAATCCAACC 
25 GCTGATTTCC TGTCTGCGCG AT G C GAT ATT 
TGGTACGAAA ATCGCTTGAA T AGT GAAAT C 
GACCTTGGTG ACATTTATTG GAATAT GCAG 
T T G AAC AT AC TTTCCTGTGA ACAAAGGAAA 
TATTTGTTGT CAACTCTTCC CGAATCAACA 
30 AAATCAAGCC TTGATAAATC CAAGATAATC 
GTAACAATAA TCTACTATAC CGATAACCCT 
AATGGAGCCT C G GC T GAT AT AACCGGGTTG 
ATACAGTT C A ATACAT CCAA CTTTGATCCC 
CAGAAAATTA TAG AT AC GGA AAAATT AC GA 
35 



CCTGCACTAA ACATTATGGC TCATGGCATG 198 0 

AACACCATTA TAAGCAGTCC GATAACAATA 2 04 0 

AAAGGAGTTT TGCATTTTAC T AAT AAT G GC 2100 

GAAAT AGGCA ATCAGGCTAA AAT AT C C G G A 2160 

GTTTACGGCG ATGGTCTTGC GATTAACAAG 222 0 

AACTTGTTTT CTACGCATTC GGTCATGCCC 22 8 0 

AGTGCCCCGC TGTATACAAC GAACTGTATT 2 34 0 

C GAAGT GAC A TTATTTCAAA GAATTGTGAC 2 4 00 

TCGGGGATAA C G GT AT T CAA GCCTATGGCT 2 4 60 

G C AAAGAT T A C C GAC AAT AC TTTTTTTGCG 2 52 0 

ACGCCAGGCT TAACAGCAAC CTCCAATGCT 2580 

T AC AT T T C C G GTAATAAAAT AGTCAATTGC 2 64 0 

AACAGAACGA ACAGACTCCA CAATATCACA 2 7 00 

AGCACGCTTT ATAATTCCTA TGGTATTTAC 27 60 

GGAGTACGTC TCCTCAACAA CAGTTGTTTT 2 82 0 

GAAGATAAGC AGACGTTTAT T T C T AAT AG G 2 880 

TTCCCTCTCA ACTTCCATTA CAACAGCTTG 2 94 0 

AAC GAC AC GT ATACGAATCG CTAT ATT GAC 3 000 

TTGTTTGATC CGAATCAGGT T T T CAAT AC G 3 060 

GATGGATTGC CAAAT GGGAG ATCGGGCAAT 312 0 

GCATTGGACT GTATTGGCAA TAG C GAT TAT 318 0 

GTTGAAACCT ACCCGGAATC CGACTTTGCA 324 0 

GAGAAAAT GT CAGGCAACGA TTACGAAGGC 3300 

ATCATCTCTT C C CAGAACTT- GTTCCCGACA 3360 

GTGTGTGAAA ACTATCAGTC TGCCATCGAT 342 0 

TCCTATCAGG ACAGTGTTTT T G C AGT CAT T 34 8 0 

TTAGACT CAC TCAGAGGGAC TGGTATAGAT 354 0 

TCGCTCGAAA G C CAT C AAAA TGTAAAAAAT 3600 

GGTACTCTCC TGCCTCCATT AGAATGCAAC 3660 

TCTATTTCGC CCAATCCGGC GAAAGCTGTT 372 0 

TCCTGTTCTG TAATAAAAAT AT AT G G AAT A 37 8 0 

CCCAAACATC TATCCGAAGG TTATTACAGC 384 0 

GGTTTCTACC TGGTAACGCT AAATGTTGAT 3900 

AT C AAA 3936 



10 



69 

(2) INFORMATION FOR SEQ ID NO: 37 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(A, ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...2814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

T CAGAAAATT ATAGATACGG AAAAATTACG AATCAAATAA TGGCTATCAT- GATGAAAAGT 
ATTGTTTTTA GAGCATTTCT AACGATTTTG CTCTCGTGGG CAGCGATCAC GAATCCGACT 
CCTCAAGAGA TCTCAGGCAT GAATGCATCC TGTCTGGCTG CTCCGGCTCA ACCGGATACT 
ATCTTATATG AAAGTTTTGA GAATGGACCT GTTCCCAATG GCTGGCTTGA GATAGATGCT 
GATGCTGATG GTGCCACTTG GGGAAGCCCA TCAGGCTCTT TCTCTGTACC TTACGGACAC 



20 



30 7rT GCACCTACTC CCATMACGT TCCGGTMCT =»<™CTG 3 6 „ 

30 ATTACACCCA ATATAGAAGG AGCCAAACGG GTCAAGTACT GGGTATGCAA TCAGTATAGT 



ACCAATCCGG AACATTACGC AGTAATGGTA TCGACAAGGG GGACTGCCAT TGAAGACTTT 
GTTTTGTTGT TTGATGATTC CATAACAGGG AAACGGACTC CTCTTGTATG GCGTAGACGA 
ATCGTGGACT TACCGGAAGG GACCAAATAT ATTGCATGGC GACATTACAA AGTCACCGAC 
TCACACAGAG AATTCTTGAA ATTGGATGAT GTCACTGTGT ATAGGTCGAT CGAAGGGCCC 
^CCTGCTA CCGACTTCAC AGTAATCAAT ATTGGTCAGA ATGTGGGACG ATTGACTTGG 
AACTATCCGG AGGATTATCA ACCGGAAGGA AAGGGGAATG AAGAGTTGCA GCTTAGCGGC 



720 
780 
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TACAACATCT ATGCGAACGG T AC AC T AC T G 
TATGTGGACA GCACTTACTC TTTGCGAGAC 
GCCGTTTACG AT G AAAG CAT AGAATCTTCG 
GATGCCATCC TTTATGAAAA TT T T GAGAAT 
5 GACGCTGATG GAGAT GGATT TAGCTGGGGA 
GGCCATAATG GAGGCCATTG CTCCTTGTCG 
ACTCCCGACA ACTATCTGAT TACCCCCAAG 
GTAAGCACGC AG GAT G C CAA TTGGGCAGCG 
GGGACTGCTG TCGGAGATTT C GT CAT ATT G 
10 GGCGCATGGT AT GAAAGAAC CATCAACTTA 
CATTACAACT GT AC C GAT AT ATATTTCTTG 
CCTGCATCAG AGCCCGAACC TGTTACCGAT 
GGACGATTAA AGT GGAATTA TCCTAACGGC 
CCATTGCAGC TTGCCGGCTA CAATATCTAT 
15 GACCCGACTG TTTTGGAGTA TATCGATGAG 
GTGGAATATT GTGTCACTGC CGTTTATAAC 
AAGCTGATTT ATGATTCTCA AT C GGACAT T 
ATTCCTGAAG GCTGGTTGTT GATTGATGCT 
CCTTGGACTA TGTATGGACA T GACAGT GAG 
20 ATGATTGGCG TTTTAACTCC GGATAACTAT 
CTTGTCAAGT ATTGGGTAAG T G C GCAAGAT 
ATGGTTTCTA CTACGGGAAC TGCTGTTGAA 
ACCGCTAAGG CTAACGGTGC AT GGT AT GAG 
TAT AT T G C CT GGCGGCATTA TGATTGCACC 
25 ACGGTTTATC GTTCTACTGA GACTGTTCCC 
ATTGAGAATA ACAAGGGTCG CCTGAAATGG 
ACTGATGATA AAAAAC CAT T GCAGCTTACC 
CTTGTTCACA TACAAGACCC GACTGTTTTG 
GACGGTCAGG TGGAAATGGA AT ATT GT GT C 
30 CAATCGGTTT GCGATAAGCT GAACTATACT 
ACAAGCTTGA AAAT AT AT C C TAATCCGGCA 
CGGAGCAAGT CGACAATCGA GTTGTATAAT 
ACT C ATT C AG AGAAAAC GGA AATCGATGTT 
AAAGTAGTCG GTGGAAATAA AACAACAACC 

35 



GCACAAATAA 


AAGATGTCTC 


CATACTGGAG 


840 


AATCCCTTGC 


AAGT GGAGT A 


CTGCGTTACA 


900 


ACCGTATGTG 


GCACGCTGCA 


TTACGCCACG 


960 


GGACCTGTTC 


CCAATGGTTG 


GCTTGTGATA 


1020 


CACTATTTGA 


AT GCAT AC GA 


CGCTTTTCCC 


1080 


GCTTCTTATG 


TTCCGGGTAT 


AGGCCCGGTG 


1140 


GTTGAAGGAG 


C C AAAC GT GT 


CAAGTACTGG 


1200 


GAACATT AC G 


CGGTGATGGC 


TTCGACAACG 


1260 


TTCGAAGAAA 


C CAT GACAGC 


GAAGCCGACC 


1320 


CCTGAAGGGA 


CTAAATACAT 


CGCATGGCGG 


1380 


AAGTTGGACG 


AT AT C ACT GT 


ATTCGGGACT 


1440 


TTCGTTGTCT 


CGCTTATTGA 


AAACAACAAG 


1500 


TACGAACCCG 


AT AAGACT GA 


T GAT AAAG AC 


1560 


GCAAACGGCT 


CGCTCCTTGT 


T C AC AT AC AA 


1620 


ACTTATTCTT 


CACGAGACGA 


T C AG GT G G AA 


1680 


GACAATATCG 


AGTCCCAATC 


GGTTTGCGAT 


1740 


AT CTT AT AT G 


AAGGCTTTGA 


GGCCGGAAGT 


1800 


GATGGCGACA 


ATGTTAATTG 


G GACT ATT AT 


1860 


AAGTGTATTG 


CATCCCCTTC 


GTACTTACCG 


1920 


TTGGTTACAC 


CCAGACTCGA 


AGGAGCCAAG 


1980 


GCTGTTTATT 


CGGCTGAGCA 


TTATGCTGTG 


2040 


GATTTTGTCC 


TCTTGTTCGA 


AGAGACAATG 


2100 


CGAACTATTA 


CATTGCCTGC 


AGGAACAAAA 


2160 


GATATGTTTT 


TCTTGCTCTT- 


GGAT GACATT 


2220 


GAGCCTGTTA 


CTGATTTCGT 


TGTCTCGCTT 


2280 


AATTATCCTA 


ACGGCTACGA 


ACCCGATAAG 


2340 


GGCTACAACA 


TCTAT GCAAA 


TGGCTCGCTC 


2400 


GAGT AT AT C G 


AT GAGACTTA 


TTCTTCACGA 


2460 


ACTGCCGTTT 


ATAACGACAA 


TATCGAGTCC 


2520 


ATCACATCCT 


TGGATAATAT 


TCAATCTGAT 


2580 


TCGTATGTGG 


TAAGGATAGA 


GGGATTGAGT 


2640 


GCGCTGGGAA 


TTT GCAT ATT 


AAGGGAAGAG 


2700 


TCACGTCTCA 


ATGACGGAGT 


CTACTTGATT 


2760 


GAAAAGGTAG 


AGATAAAGAG 


GCCG 


2814 



71 
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(2) INFORMATION FOR SEQ ID NO: 38 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival!, 



(ix) FEATURE: 

(A) NAME/ KEY : n.isc_f eature 

(B) LOCATION 1...3753 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 



AAATGGAAAT TGGCATGCGC TTTTGATTGC GCGTGTTGTT TCGATCCATT- TGTAGTAACT 
25 AACGAGGTAA TAATAATGAT GAAAC GAT AT ACAATAATTC TTGCAGTTTT TCTTTTATTC 



120 
180 
240 
300 



-A- j- _l i^iuiAHl 

TGCACGGTAT TTACCTTTCA AATAAAAGCT CGCCCTTATG AAAGATTTGC AGATGTAGAG 
AAGCCTTGGA TTCAGAAACA TTCAATGGAT TCTAAATTGG TGCCTGCAAA TAAGGGTAAC 
TTAATTCAAG CTGAAATTGT ATACCAATCT GTTTCTGAAC ATAGTGACTT AGTTATTTCA 
CCTGTGAACG AAATAAGGCC TGCAAATCGT TTCCCTTCGC ATAGGAAGTC TTTTTTTGCA 360 
30 GAAAAT CTAC GGGCATCTCC CCCCGTAGTT CCCGTTGCCG TCGACAAGTA TGCGGTACCG 
GTTGCCAATC CAATGGATCC TGAAAATCCC AATGCCTGGG ATGTGACGCT AAAAAT CACT 
ACTAAAGCGG TAACAGTACC TGTCGATGTG GTGATGGTTA TCGACCAGTC TTCGTCAATG 
GGAGGGCAAA ACATTGCCAG ATTAAAGTCT GCCATTGCAT CGGGACAGCG TTTTGTGAAA 

35 c2l GTTGC CTAAGGGGAC GGCTACAGAA GGGGTGCGTA TCGCTCTTGT -™ eeo 

CATGAGCCTC ATCGCTTATC TGATTTTACC AAAGACACTG CTTTTCTCTG TCAAAAAATC 
CGGGCTTTGA CTCCTATTTG G G GAACACAT ACCCAGGGGG GGCTTAAAAT GGCGAGAAAC 



420 
480 
540 
600 



720 
780 



72 





ATTATGGCCA 


CTTCTACTGC 


TGTGGATAAG 


CATATCATAT 


T GAT GT CT GA CGGGTTAGCG 


840 




ACGGAGCAGT 


ATCCTGTTAA 


AAATGTAACT 


ACTGCAGACT 


TCATTGGCAA AACTGGAAAT 


900 




GCGAATGATC 


CCATTGATTT 


G GT T AT AC AA 


G GAG CAATT A 


ATTTCCCTAC AAATTATGTT 


960 




TCCAACAATC 


CATCTACACC 


TCTTACCCCA 


AATTATCCAA 


CTCATTCTTC TAAAGTTGGA 


1020 


5 


C GGAGAAAT C 


TGCCGGAATC 


CAAATTCGAT 


T AT AGT AAT C 


TGAGTGCAAG GATTACTTTT 


1080 




GATGGTGTTG 


CTGGCGCATT 


GGTCTATGAA 


CCGAGGTTTC 


CTCATCCCTA T TAT TAT TAT 


1140 . 




TTCCCTTGTA 


ACGCTGCTAT 


CAATGAGGCT 


CAGTTTGCGA 


AAAACTCTGG TTATACAATC 


1200 




CAT AC TAT T G 


GCTATGACCT 


GGGAGATTTT 


GCCTTGGCCA 


ACAATTCGTT GAAACTAAC C 


1260 




GCTACAGACG 


AGAATCACTT 


CTTTACGGCG 


ACACCGGCCA 


ATTTAGCTGC AGCGTTTGAT 


1320 


10 


AATATTGCCC 


AAACTATTAA 


TATAGGTATA 


CAGAGGGGGG 


AGGTGACGGA CTTTGTAGCT 


1380 




CCTGGTTTCA 


TCGTTAAAAA 


TCTGACGCAA 


TCGGGAGATG 


TTACTCATTT GCTAAATGTT 


1440 




TCAAATGGAA 


CGGTGCACTA 


TGATGTCTCT 


ACTAAAAAAC 


T G AC AT G G AC TACT GGT ACT 


1500 




ATCCTGAGCT 


CATCAGAAGC 


TACCATAACT 


TATCGTATTT 


ATGCCGATTT GGAT TAT AT A 


1560 




CAGAACAATG 


ATATTCCGGT 


AAATACTACT 


TCTGCTATCG 


GCCCGGATCT TGGTGGATTC 


162 0 


15 


GATACCAATA 


CCGAGGCAAA 


ATTGACCTAT 


ACCAATTCCA 


ATGGCGAACC GAATCAGCAG 


1680 




TTAATTTTCC 


CACGTCCGAC 


GGTTAAGTTA 


GGTTATGGTG 


TTATTAAGCG GCACTATGTA 


1740 




T T GGTAAAT A 


AAGACGGTCA 


AC C CAT AC AG 


GCAAATGGAA 


CAGTTGTCAG TTCCCTAAGC 


1800 




GAGGCTCATG 


TTCTACAGTC 


ACAAGATTTC 


TTTTTGCCCT 


CAGGTGGAGG T CAT ATT GTT 


1860 




CCCAAATGGA 


TAAAGTTGGA 


CAAAACGACC 


GAAGCAT T AC 


AGTACT ATT C CGTACCGCCG 


1920 


20 


ACTAACACGG 


TCATCACTAC 


TGCCGATGGT 


AAACGTTATC 


GTTTTGTCGA AGTCCCAGGC 


1980 




TCCACGCCGA 


ATCCGGGCCA 


AATCGGTATC 


AGTTGGAAAA 


AACCGGCAGG AAACGCTTAC 


2040 




TTCGCTTACA 


AGCTCCTCAA 


TTATTGGATG 


GGAGGAACAA 


CAGACCAACA GAGTGAATGG 


2100 




GATGTGACGT 


CCAATTGGAC 


AGGAGCCCAA 


GTACCGCTCA 


CAGGAGAAGA TGTAGAGTTT 


2160 




G C AAC G AC AG 


AAAATTTCGG 


TTCTCCGGCG 


GTAGCCGATT 


TGCATGTCCG- G AC AAC C AAC 


2220 


25 


CCCAAAATTA 


TCGGTAACCT 


TATCAATAAT 


TCCGACAAGG 


ATTTAGTTGT TACCACAAGC 


2280 




AGTCAATTGA 


CGATCAACGG 


CGTGGTTGAG 


GAT AAC AAT C 


CGAATGTCGG TACGATCGTC 


2340 




GT GAAGT C GT 


CGAAAGACAA 


TCCTACGGGG 


ACATTGCTTT 


TTGCCAATCC GGGCTATAAT 


2400 




C AAAAT GT AG 


GGGGGACCGT 


CGAGTTTTAC 


AATCAGGGAT 


AT GATT GT G C CGATTGTGGT 


2460 




ATGTATCGCA 


GGAGCTGGCA 


GTATTTCGGT 


ATCCCTGTCA 


AT G AAT C AG G TTTTCCAATT 


2520 


30 


AATGATGTGG 


GCGGAAACGA 


GACCGTCAAC 


CAATGGGTTG 


AGCCTTTCAA TGGCGATAAG 


2580 




TGGCGGCCAG 


CACCTTATGC 


ACCTGATACA 


GAGCTTCAAA 


AATT CAAGGG CT AC CAGAT C 


2640 




ACGAATGACG 


TGCAGGCACA 


GCCTACGGGA 


GTTTACAGCT 


TCAAGGGTAT GATTTGTGTG 


2700 




TGCGATGCCT 


TCCTGAATCT 


GACACGCACG 


TCCGGTGTCA 


. ACTACTCGGG CGCCAACTTG 


2760 




ATCGGCAACT 


CATACACTGG 


AG C CAT C G AC 


AT CAAGCAGG 


GTATTGTCTT CCCGCCGGAA 


2820 


35 


GTCGAGCAGA 


CGGTGTATCT 


GTTCAACACG 


GGAACACGCG 


; ACCAGTGGCG TAAGCTTAAT 


2880 




GGAAGCACGG 


TTTCAGGCTA 


. TCGAGCCGGT 


CAGTACCTCT 


1 CTGTACCTAA GAATACAGCG 


2940 
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GGTCAGGACA 


ATCTTCCGGA 


TCGTATTCCA 


X ^-OLrt.X bL/\l 1 


L,L, 1 111 1 CjCjI 


GAAG AT G C AG 


3000 


AACGGAGCGT 


CTTGTAfGTT 

w X X W X 1 k V- \J J. J. 






I GCTCAAGAA 


CAC GAC T GT A 


3060 


AACAACGGTA 


ATGGTACGCA 




*-<j/\X CCGGCA 


ACTCCGGATC 


GG CGAAT AT G 


3120 


CCGTCACTTG 


T GAT G GAT GT 


X \_ X X VJ*J X AA\^ 




ACCGTTTGTG 


GATCTTTACC 


3180 


GATGGGGGTC 


TTTCTTTCGG 


ATTCGACAAC 


GGCTGGGATG 


G I CGCAAGCT 


GAC T G AAAAA 


3240 


GGTTTGTCAC 


AACTTTATGC 


GATGTCTGAC 


ATCGGTAATG 


A I AAA! TCCA 


GGTTGCAGGG 


3300 


GTTCCGGAGT 


TGAATAACCT 


GCTGATCGGC 


TTCGATGCGG 


7\ r P7\ Ar'PATpr 1 
/A. 1 /\ACj> LtA 1 (jrG 


T C AAT AC AC G 


3360 


TTHf^Af^TTTn 

± ± \J yjfWD X X X V7 




TCATTTTGCG 


AAAGGGGCTG 


TTTACCTGCA 


CGATCTTCAG 


3420 


TCAGGAGCCA 


AACACCGTAT 


TACGAATTCT 


ACGTCGTATT 


CATTCGATGC 


CAAGCGGGGA 


3480 


GATTCCGGGG 


CTCGTTTCCG 


C TT GT CAT AT 


GGATGTGATG 


AGAACGTAGA 


TGATTCGCAT 


3540 


GTCGTGAGTA 


CAAATGGCCG 


T GAAAT TATA ATTCTGAATC 


AAGATGCTCT 


TGACTGCACT 


3600 


GTAACCTTAT 


TCACAATAGA 


AGGTAAGCTT 


CTTCGCCGCT 


T GAAAGT' AT T 


AGCTGGTCAT 


3660 


AGAGAAGT C A 


T GAAAGT G C A 


GACCGGAGGG 


GCCTATATTG 


TGCATCTTCA 


AAATGCTTTC 


3720 


ACTAATGATG 


TGCATAAGGT 


GCTTGTTGAG 


TAT 






3753 



15 



(2) INFORMATION FOR SEQ ID NO: 39 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

25 <ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

30 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
35 (A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1278 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 



ACTATGGAAG TGAAGAAAAA CACAGTGGTG 
5 CTTCTCTTCC ACTCCTCACG GCTGTGGGGA 
TTCAAAGGAT T C GT G GAT AC CTACCATGCC 
AGCTCGCGTA C GAGAG T GAG AGGTGAGCTG 
GTATCGGTCA ATGCCACCTA CAATGCTCTA 
GAAGCCTTCT T C GAG CAT C A GGAAGAGCAT 
10 ATTTGGGGGG CTGCCGACGG TGTGCGCATC 
GAGTTTCTGG CACAGGATTA C GAT GAT AT T 
GTCTTCAACG AATCGATGAA AGTGGAAGTC 
CTGCCTGTGG ATCCTCGCAA TCCTTGGAAT 
ATGAATATCG TCTGGAAAGA AGAAGCCGGC 
15 TACGGTGCGC GAT G GAG C AC TACGCTCTCC 
ACATGGAACA AGATGCCCGT CAT C GAAGTA 
AGCCCTCGCT AT TAT C GT AT GGGATTTGTC 
TTTGTTTTCA GGGGAGAGGC TGCGTTCAAT 
GCCGAGCAAG AGGGTTTCCA AAC AAT C AAT 
20 GGTGAATGGA TGATCTCAGG ACAATTCTCA 
ATCTCCCAAA G AC AAC AT T C TACCCTGATT 
AGTACACTCC AACTTTCGGA CTTCACCTAC 
CGCTTTGCAG CTGACTATGC CTTGAACGAT 
TTCAGTAGTA AGGGCAGCGG TATATTCGAT 
25 AAAGCCCGCT ACAGCTTC 



CTACGCCTTC TGATTTGGTT CGTGGCCATT 60 

CAGGAAGGGG AGGGGAGTGC CCGATACAGA 12 0 

GTACGCAGCT CTTCTCCTTT T GATT T CAT G 18 0 

GAGAGGTCGT TCGGTAATTC GAAAGTAGCC 24 0 

CTGAAAGACG AGACCGGCTT ACGTTTACGT 300 

TGGGGGTTGC GCCTCGGACG AC AGATT GT C 3 60 

ACGGATCTGA TCTCCCCGAT GGATATGACC 420 

CGTATGCCGG TCAATGCATT GCGTTTCTCT 4 80 

GTGGTACTGC CTGTATTCGA GGGGTACCGT 54 0 

ATCTTCTCCC TTTCGCCCAT TGCTCAGGGG 600 

AAACCGGCCT TCAAGGTTGC CAATATCGAG 6 60 

GGTATCGACT TCGCTTTGGC TGCATTGCAT 72 0 

CAGGGCATTG TGCCGACGGA AAT CAT C GTT 7 80 

GGCGGCGACC TCTCCGTACC CGTCGGACAG 84 0 

AT C G AC AAAC ACTTCACCTA T AAGAGT CAT 900 

TGGTTGGCCG GAGCCGATTG GTATGCTCCC 9 60 

ATGGAAAGCA TATTCAGGTA TAGGGATTTC 102 0 

ACTCTCAATG TTTCCAAGAA ATTCTTCGGC 108 0 

TACGACCTTA CGGGCAAAGG ATGGTTCAGT 114 0 

C AGAT AC AT C TGATGGCCGG ATATGACTGG 1200 

CGCTACAAAG ACAATTCCGA ACTCTGGTTC 12 60 

1278 



(2) INFORMATION FOR SEQ ID NO: 40 

30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
0 (A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 



15 



20 



25 



AGGGGGTCGA 


GTTCGGGCAT 


TTCAGCCCGT 


G GAC GC GAC A 


TGCGTTCTTT 


ATTTTTGAGC 


60 


GCGTTGCGCA 


GCTCCTCTCT 


CCATGGTTCA 


GAGCGACGCA 


GTCGGATAAG 


TTCTTCTGTA 


120 


GTCATGTCAA 


TAAGGCAGAA 


AATAAGGTTA 


TTCCATCTCT 


CGGTATGCGC 


C C AAAC G CAT 


180 


GAT CAT C T C A 


TCGAAATCCA 


CTTGGTGTGC 


ATCGAATTCG 


GGGCCATCGA 


CACAGACGAA 


240 


TTTCGTCTGT 


CCTCCCACGC 


TTATAC GACA 


AGCCCCACAC 


ATACCGGTGC 


CATCCACCAT 


300 


AATTGTATTG 


AGAGAAGCTA 


TGGTCGGTAT 


CTCGTAACGT 


TTGGTCAGGA 


GAGAAAC GAA 


360 


CTT CAT CATC 


ACAGCCGGCC 


CGATCGTAAC 


GCAGAGGTCT 


ACCGTTTCCC 


GTTTGATAAC 


420 


GCTTTCCACT 


CCATCCGTTA 


CGAGGCCTTT 


C GT C C CAT AA 


GACCCATCGT 


C T GT C AT GAT 


480 


GATCACTTCA 


TCGCTATTGG 


CTCGCATTTG 


TTCTTCAAGG 


ATAACCAGAT 


CTTTAGTTCT 


540 


GGCAGCCAAT 


AC GAC AATTA 


CACGGTTGCC 


TGCTTTGTGG 


AAAGCCTCCA 


CGATCGGGAG 


600 


CAAAGGAGCC 


ACACCCACAC 


CGCCTCCGGC 


ACAAACCACT 


GTGCCGACCT 


TTTCGATATG 


660 


CGTACTCTGT 


CCCAGCGGAC 


CTACCACATC 


C GT GAT AT AG 


TCGCCGACTT 


CGAGTTCGGC 


720 


CAATTTCTTG 


GAAGATTTGC 


CCACGGCCTG 


AACCAC 






756 



(2) INFORMATION FOR SEQ ID NO : 4 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . . 798 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

AT AAT T TATA GATCAACAAT GAAACTATTA CTTTATCTCC TATTGGTCTT GTCGACTCTA 
TCCCCGATGT ATTCGCAAAT GCTCTTCTCA GAG AAT C T C A CAATGAATAT AGACAGCACC 
AAAAC CATAC AAGGAAC GAT ATTGCCCGTA CTGGATTTCA AAACCGAAAA GGAAAATGTG 
TTCACCTTCA AAAATACTGC CAATCTCAAT CTGCTGATAA AGCACGGTCA AGTAAT CAAC 
TTAATTAATA AGCTTGAGTT TTCTACCTAT GGCAATAAAG TAACCGTAAG T GGAGGAT AT 
GTACACACCG AATACCGCTA TTTGTTGCAT CATGTTTTTG AGGTTTATCC TTATGTCGAG 
TCGCAATGGG CAGAAAGTAG AGGAATGAAA TATAAGGTTT CTACGGGATT ACAGTCGCGT 
TATCGGCTGG TAAATAGTGA TAACTGTCTC ATGTTTGCAA CATTGGGGGT- ATTTTTCGAA 
TTCGAAAAGT GGGAACAGCC AGCCACTAGC CTCTTTGCAG GAACGTATGC ATACAGCCGA 
AGTAT CAAAA GCCACCTGTC TATCAGTTTC AGACATCGGT TGGGTGAACA TTGGGAATTT 
ACAACTACGG CTATTCACCA GGGAAAGCCT GACAGTT AT T TTAAGAAGGC ACGTTTTGGA 
GGAGCTATCG ACCTCAAATA C CAT AT C AC A CCTACGATAG GAATACGCGG GGCCTATCGG 
AT CAT CTAC G ATACTGCCCC TATTGTACCT GTGCGGAAAG ATTACAACAC CGTTGATGTT 
GGT AT C GAT A TTTCGTTT 

(2) INFORMATION FOR SEQ ID NO: 42 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

5 (ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

FEATURE: 

(A) NAME/ KEY : niisc_f eature 

(B) LOCATION 1 . . . 987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



20 



25 



30 



35 



GAGACGAACT 


CTTGGGTATC 


: CAGCGATTGC 


: AATTCGACGA 


. CGATGAAAAC 


GAATAGACGA 


60 


TACGCATTTG 


TTTTGCCGCT 


TCTGCTACTC 


ACCGGATTGT 


TGGCATGGGG 


GCAGGATTCT 


120 


TCCCACGGTA 


GCAAT AC AG C 


GTTTGCAACT 


GATTCTTCGA 


GTAGAGAGTT 


GCCCACGGAG 


180 


CAGTCCGCCT 


ACCGCATTCA 


TTCTGCCTAT 


ATGGTCGGTG 


GTGGCGGAAG 


CATAACGCGC 


240 


GACAC CTATT 


TGTCACCCCT 


TCGTTATGGA 


G GAT GGACAC 


TGAATTTGTT- GG GAGAGAAG 


300 


ACGTTCCCTC 


TCAAAGCCTC 


CGATTCCCGT 


TGGATGATCC 


GTACCGGGCA 


TGAGCTGGAT 


360 


TTTGCCCTGA 


TGGACAATCC 


GGCCAATAAT 


GCTCATTTCT 


ATTCCCTGCT 


GTATAACGGT 


420 


TCCGCTGCGG 


CTCTTTACCG 


CCTTGGCGCT 


AAGCATCTGC 


GAGCCGCGTG 


GATGGACAAT 


480 


CTGCGCTTGG 


CATTCGGCCC 


GGGCTTGGAA 


ATCGGGCTTG 


GAGGAATTTA 


TAGTACACGC 


540 


AACGGCAATA 


ATCCTGCGAC 


ATTGAAGCTC 


T AC AC CAAT G 


CCATCGCCCA 


AGCCTCGATA 


600 


GGATACTACG 


TCCCCTCCGA 


AACTTTTCCC 


CTGTATTTTC 


GGTTGCTCTC 


CCAGATCAAT 


660 


CTCTTCGGTA 


TAG C CT AT GG 


AAATGGTTTT 


GGTGAGAGCT 


AT TAC GAGAA 


TTTTTTGCTC 


720 


AATAACGGCA 


TTGCAGGCTC 


CCTGCATTTC 


ACTTATCCGG 


GCAAGTTTAC 


TCGGTTCACG 


780 


ACACTCATAA 


C GG C G GAT AT 


TCCCATTCGG 


AACTTCTGTA 


CGCTTCGTGT 


CGGTTATCGC 


840 


TATTCCCATT 


TGGGCTCTTC 


GCTTAACGCA 


TTGGATACTC 


GAAT C C AC AG 


TCATACGGCT 


900 


TTTATCGGTT 


TCGTCACGGA 


GTTTTACCGA 


TTCCGTGGGC 


GCAAAGCCAT 


GAATACCGGT 


960 


CGGAGAACCA 1 


GTCTTTACTA 


T CAT GAT 








987 



(ix) 



15 



78 



10 



(2) INFORMATION FOR SEQ ID NO: 43 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2442 base pairs 

(B) TYPE: nucleic acid 

{ C) STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 



15 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

20 (ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1...2442 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 



CCCCGTTTTC 


ATCCCGGAGG 


GACACACGCT 


CAGCTTTGCA 


GAAAT GG GAG 


AAGAAACCAA 


60 


AAATCAAATC 


AGTCATCGTG 


CTTTGGCGGT 


GGCACAGCTC 


CGCGATTTTT 


TATTATGTGC 


120 


AAAATAAGAT 


TCAGCCTCTT 


GCAGGCTCTT 


GTCGTCTGCT 


TAT T GT T C AC 


CTCTTTTTCT 


180 


CTCCAAGCTC 


AAGAGGAAGG 


TATTTGGAAT 


ACCCTCTTGG 


CTATCCACAA 


GAC GGAAAAA 


240 


GCCGTAGAAA 


CGCCCAAGAA 


AGTCTTTGCC 


GTAGCCAACG 


GAGTACTTTA 


CTCGGTGGGC 


300 


AAAGAAGCTC 


C C CAT GAGGC 


AAAGATCTTC 


GACCGTATCA 


GCGGACTCAG 


C GAT AC AT C G 


360 


GTAAGCAGCA 


TAGCCTACTC 


CGAGCAACTA 


AAATCCTTGG 


TCATATACTA 


TGCATCAGGC 


420 


AAT AT C G AC A 


TCTTGGACGA 


AGCAGGCCGT 


GTGACCAACG 


TACCTGCATT 


GAAAGACAAT 


480 


ATCGATCTGA 


TAGACAAAAC 


GCTCAATCGC 


CTTTTGATCG 


TAGGCAACAG 


GGCTTATTTG 


540 


GCAGGAGGAT 


TCGGCCTCTC 


CGTTCTGGAT 


GTCGCCGAAG 


CTCGCATACC 


GGCTACCTAC 


600 


GCCAAGGGAA 


CTAAGGTGAC 


CGATGTGGCT 


AAGTTGGACA 


ATGATCGCTT 


GCTGATGCTG 


660 
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AAAGAAGGGC AGCTCTTCAT CGGAAAAGAG AC C GAT AAC C TGCAAGATCC GGCCGCATGG 720 

ACAGCCTTGT CTTTGAATTT GCCGATGGGC TCGGTCACCG GTCTGGGCAT TGTCGGGGAA 7 80 

GACATCTGTT TCCTGCTCGC CGATGGCCGT GT AT AT GT C G CTGCAAACCA ATCGTTTGAG 8 40 

CCGGAGCTAT TGCTCTCTTC CTCCGCCGAT TCACGACTGT ATGTGACGGA TCGTGGTCTG 900 

5 TTCATCTGTG CCGAGAATCG AATTTATTTC ATAGAAAAAG GTCGCAAAAC GACACAATTT 960 

CCTATAGCCG ACGTCCTTGG TGTCGGTGCC AT GAACGAAA GCAATACGGC ATACATAGCA 1020 

TTGGGAGAAG AAGGTTTGGC TTCACTTCTT CTCGCAGAGG GAAGTACGGC CGAAGCCATG 1080 

CCTGTAGCAT TCGACGGACC GGGGGACAAT GATTTCTACG AGATGCGGTT TAGTCACGGA 114 0 

CGTCTGTATG CAGCCAGCGG ACTCTGGGGA ACAAACCTGA TGGGACATGC CGGTATGGTG 12 00 

10 AAGCTATACG ACGGCAACCG AT GGACT AAC TTCGACAAGA AGACC GTAC A GGAACAGTTG 12 60 

GGCGGCGGAT TCAGTTTCAA TGATGCTATC GAT AT AGCT G TTTCCAACGG AGACCCCGAT 132 0 

CACTTTTTTG T C GGTAC AT G GGGAAACGGT CTGTTCGAAT TCAAGGATGG CAAAGC GAT A 13 8 0 

GCTCGCTATT CGGGAAACGA AACTGCTATC GCAGAAT GTA ATCCCGGAGA TGCCCGTGTG 14 4 0 

AAAGCGATTG CCTTTGACAA TAAGGGCAAC CTCTGGGGGA CGCTCGGTGC CGTAGGCAAG 1500 

15 AACATCTTCA TGTACGATCC G C AG AG TAG C ACATGGCATT CTTTCAGCTA TCCGGATGTA 15 60 

GCCAATCTGG CCTCCTTCGG CAATATGATT ATCCTACCCA ACGGAGACAA ATGGGTAAAT 1620 

ATCCTTCACC GTAGTGGCGG ATCCACGCGC AAAGGTGTCT TGATCTTCAA CGATCGGGGT 1680 

ACACCGGAAA CGACTTCGGA C GAC AG C CAT CTTTACGTCG AGCAGTTTGT CAATCGCCTC 17 4 0 

GGGGCAGCCA TAG GAC AT AA GACTAT C TAT GCAATGGCCG TCGATCATAA CGGCTCTGTC 18 00 

20 TGGATGGGAT CGGATATAGG CATTTTCGGC GTCTACAATG CAGCCGGAGT ATTGTCCTCG 18 60 

ACTTCTACCC CTATCGCTGT TCGGCCGGTC GGAGGAGAAG AACCCAATTT GTACTATGTG 1920 

CTGGACAAGG TGACGGTGAC AGACATCGTC GTGGACAAAC TCAATCACAA ATGGGTTGCC 1980 

ACCCAAGGGA CAGGACTCTA TCTCCTTTCG GAAGATTGCA GTAAGATCCT CGCGCAATTT 2 04 0 

AC C GT AGAAA ACAGCCCTTT GCTTTCTAAC AACATACTAT CCCTGGCCTT- AAAT GAC GAT 2100 

25 AACGGACTGC TGTACATCGG TACGGCGGAC G G ACT GAT G A CGTTCCAAAC GGGTACGGGG 2160 

AGTGGATCAG CTTCCGAACT GGACGGCGTC TAT GT AT AC C CCAATCCGCT AAGGCCGGAA 2220 

TATCCCGATG GCGTCACCAT TGCCGGACTG CAAGCCGGCT GTAGTGTCAA AATCACCGAT 22 80 

ACCACCGGCA GACTGCTATA C CAGACT GAG AGCGTAACCA CCGAAGTCAA ATGGAATGCT 234 0 

CGAGGTGCCG ATGGCAATAG GGTAGCTTCG GGCGTATATG CCGTTGCAGT GTACGATCCG 24 00 

30 GT AT C GAAAA AGT CCAAACT AATTCGCTTC GCAGTGATTC GC 2442 

(2) INFORMATION FOR SEQ ID NO: 44 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Forphyroraonas gingivalis 



(ix) FEATURE : 
15 (A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1 . . . 1353 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

20 AAAAC AT CAT ATCGGAATAA TATGCGTTTC CAACATTATC TCATCTGTAC GGCTGCCGTA 60 

GCGGCTTTGG CTGCGAATCC CCTTACGGGC CAATCGAATA TGACCCTCGA AGAGT GCATA 12 0 

GACTATGCAC GCCGGCACAG TTCGGCCGTG GCGCTGTCCG CTGCGGAACT GGAGCAGTCC 18 0 

AAGGCCGATT ACCTTCAGGC CGTCGGCAAT TTTCTGCCCC GTGTATCGGC CGGAACCGGT 24 0 

GCTTCGTGGA ATTTCGGACG CGGATTGGAT GCCGAGACGA AT AC CT AC AG- C G AC AT C AAC 300 

25 AGCTTCAACA ATT C GTAC AG CATACATGCC ACGATGACCC TTTTCGACGG TTTGCAGAGT 360 

GTCTATCGAC TGCGGATGGC G CAT GCAC G C CGGGAGGCTT CGCGCCTCTC CGTTCGCGA.G 42 0 

CAGCAGGAGC TGGCAGCTCT CGGCACCACG GAGGCCTACT ACGACCTCGT CTATGCGCGC 480 

CAAATGCAAG AGCTGGCCAT GCAGAAGTAC GAGGAGAGCA GCCGCCTCCA CCGGCAGACG 54 0 

GCTCGAATGG AAGAGCTGGG GATGAAGAGT CGTCCCGATG TCCTCGAGAT GCAGTCGCGA 600 

30 ATGGCCGGTG ACCGTTTGGC CCTGACTCAA GCGGACAATC AGT GC AT CAT CGCTCTGATC 660 

CGGCTCAAAG AAAAAATGAA CTTCCCCATC GATGACGAAC TCGTCGTAGA C GAT AT GC C G 72 0 

GCTGACAGTC TCTCCGCCGA CATGGCCGAA TCGGACAGCT CGGCCGGCGT CTTCGCCCGT 7 80 

GCTGCCCATC ATCATCCCGT CCTCCTCCGT GCCAAACTCG ACGAGCAGGC TGCCACCGAC 84 0 

CGTTTGCGAG CCGCGCGAGG TGCATTCCTG CCGAGTGTGT CGGTATCCGG AGGAT GGAAC 900 

35 ACGGGATTCT CACGCTTTTT GAATGGATCG GACTATACGC CCTTCAGCGA GCAGTTTCGG 960 

AACCGTCGGG GGGAATACGT CAGTCTGAAT CTGAGTATCC CCATCTTTTC GGGATTCAGC 1020 



81 



5 



CTTGTGAGCC 


ATCTGCGTCA 


GGCGCGTGCC 


GAACGCAGGG 


CGGCAATCGT 


CCGACGGGGC 


1080 


GAAGCGGAGC 


GCAGGCTCTA 


CAGCGAGATC 


GCCCAAGCCA 


TGGCCGACCG 


GGATGCCGCT 


1140 


CTGGCTTCCT 


ACCGCCAGGC 


GAAGGAGCAT 


ACCGACGCCA 


TGCAAACCGC 


TTACGAAGCC 


1200 


GTCTTGCAGC 


GTTAT GAG GA 


GGGGCTGAAT 


ACGGCCATCG 


ACCTGACCAC 


TCAGGCCAAT 


1260 


CGGCTCCTGG 


ATGCCCGTGT 


GCAGCGACTG 


AGAGCGGCCA 


TGACCTACCG 


GCTCAAATGC 


1320 


AAACTCATAG 


CCTATTACGG 


CTGCCTTTCG 


GAC 






1353 



10 (2) INFORMATION FOR SEQ ID NO: 45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

20 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

30 Asp Tyr Gin He His Glu Gly Ala Tyr He He Pro Val Lys Glu Gin 
1 5 10 15 

Met Arg Pro Leu Phe Asn Gly Asn Asn Asn Phe Ser He Tyr Val Val 

20 25 30 

Ala Asn Leu Asp Phe Asn Ala Pro Ala Thr Glu Ala Ala Leu Ser Gin 
35 35 40 45 

Phe Val Val Glu Lys Ser He Glu Val Ser Ser Thr Thr Ala Pro Ala 
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50 55 60 

Asp Phe Val Met Leu Ala His Gly Asn Lys Gin lie Asn Met Ala Thr 
65 70 75 80 

Thr Glu Gly Lys Leu Leu Gly Asp Tyr Lys Leu Lys Arg Val Ala Ala 
5 85 90 95 

Lys He Arg Met He Lys Pro Thr He Asn Val Gin Gly Tyr Glu Val 

100 105 110 

Val Gly Asn He Gin Ala Lys Phe Arg Asn Ser Val Thr Lys Gly Phe 
115 120 125 

10 Leu Thr Thr Glu Ala Gin Glu He Pro Ala Ala Ala Ser Tyr Lys Thr 
130 135 140 

Ser Glu Tyr Leu Asp He Ala Glu Ser Ala Pro Ala Asn Ser He His 
145 150 155 160 

Phe Tyr Ser Tyr Tyr Asn Lys Trp Thr Leu Ser Thr Pro Glu Lys Arg 
15 165 170 175 

Pro Glu Phe Phe He Met Val Lys Phe Lys Lys Thr Gly Gin Pro Asp 

180 185 190 

Asn Thr Ala Lys Pro Tyr Tyr Tyr Arg Val Pro Leu Glu Ser Gin Asp 
195 200 205 

20 Asn Gin Val Lys Ser Asn Val Leu Tyr Asn Leu Asn Val Lys He Glu 
210 215 220 

He Leu Gly Ser Leu Gin Glu Pro Glu Ala Val Ser Val Asn Gly Thr 
225 230 235 240 

Leu Ala He Glu Glu Trp He Leu His Gin Asp Ala Phe Asn Leu Pro 
25 245 250 255 

Ala Thr Asn Tyr Leu He Val Glu Gin His Glu He Phe Met Asn Asn 

260 265 270 

Val Asn Thr Tyr Ser Val Lys Tyr Gin Thr Ser Gin Lys Pro He Ser 
275 280 285 

30 He Ser He Gin Ser Val Thr Phe Ser Tyr Val Ser Ser Asp Gly Thr 
290 295 300 

Gin His Asn Asp Leu Val Ala Ser Ser Ser Asp Gin Tyr Pro Thr He 
305 310 315 320 

Thr Ser Asp Asn Thr Ser He He He Thr Ser Lys He Pro Val Asn 
35 325 330 335 

Asn Val Pro Lys Lys He Val Phe Glu Val Thr Asn Gly Val Ala Gly 



4 
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340 345 350 

Leu Lys Glu Thr Val Thr Val Leu Gin Tyr Pro Ala Gin Phe He Val 

355 360 365 

Asn Thr Leu Gly Thr Ala Ser Ala Trp Arg Pro Asp Gly Ser Leu Ala 

370 375 3 8 o 

Pro Gly Leu Asn Asn Lys Ala He Tyr His Val Val Val Leu Val Pro 
385 390 395 4 00 

Pro Glu Asn Leu Phe Glu Asp Gly Thr Gin Thr He He Gly Tyr Pro 

405 410 415 

Pro Thr Glu Thr He Ser Phe His Lys Lys Glu Asn Asn Thr Tyr Pro 

420 425 43 0 

He Val Trp Ser Asp Thr Asn Thr Thr Lys Gin Asp Leu Glu Thr Ser 

435 440 445 

Arg Met He Ser Pro Ser Phe Glu Leu Ala Ser Gin Leu Gly Ala Thr 

450 455 460 

Leu Pro Met Pro Tyr Leu Glu Tyr Trp Pro Gly Thr Ser Tyr Leu Leu 
465 470 475 480 

Asp Tyr Ser Gly Asn Tyr Asn Asn Lys Arg Tyr Ala Leu Phe Asn Cys 

48 5 490 495 

Ala Phe Tyr Trp Glu Lys Arg Lys Val Asn Asn Glu Glu He Lys Phe 

500 505 510 

Asp Asp Trp Arg Leu Pro Thr Glu Ala Glu He Lys Leu He Asp Lys 

515 520 525 

Leu Gin His Asn Glu Gin Ser Ala Val Gin Ala He Met Th,r Gly Asn 

53 ° 535 540 

Tyr Tyr Trp Asp Ser Tyr Ser Ala Asn Gly Ser Tyr Lys Met Gin Gly 
545 "0 555 560 

Gly Gly Gly Gin Gly Asn Ser Ser Lys Ala Tyr Val Arg Cys Val Arg 

565 570 575 

Asp Val Lys Lys Pro He Arg Asp Lys Lys Ser Gly Lys 
580 5 8 5 



(2) INFORMATION FOR SEQ ID NO: 46 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 907 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 

Thr lie Cys Val Ala Phe Leu Ser Ala Pro Val Ala Ala Leu Arg Glu 

15 10 15 

Ser Pro Pro Met Gly Ala Glu Arg Lys Thr Pro Ser Leu Leu Pro Leu 

20 25 30 

Leu Phe Glu His Ser Asp Lys Gly Lys Gly Phe Asp Tyr Arg Leu Phe 

35 40 45 

Thr Ser Asn Lys Leu Lys Val Phe Ala Thr Gly Asn Ser Arg Tyr lie 

50 55 60 

His Asn Lys Pro Thr lie lie Gin Ala Met Lys Arg lie Val Leu Ser 
65 70 75 80 

Ser Phe Leu Phe Val Leu Ser lie Leu Ser Leu Met Ala Gin Asn Asn 

85 90 95 

Thr Leu Asp Val His lie Ser Gly Thr lie Lys Asp Ala Ser Ser Gly 

100 105 HO 

Glu Pro Val Pro Tyr Ala Thr Val Ser lie Arg Leu Thr Gly Ala Asp 

115 120 125 

Thr Thr Gin Val Phe Arg Gin Val Thr Asp Gly Asn Gly Tyr Phe Val 

130 135 140 

lie Gly Leu Pro Ala Ala Pro Ser Tyr His Leu Thr Ala Ser Phe Val 
145 150 155 160 
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180 185 



5 Ser Thr Val Thr Val Ser Ala Ala Arg Pro Leu Val Lys Met Glu 



195 200 



Gly Met Lys Thr His Thr Met Gin He Ser Arg Gly Asn Gly Gin His 

165 170 175 

Asp He Lys Ser He Asp lie Ser Leu Glu Ser Glu Asp Lys Gin Leu 

190 

He 

205 

Asp Arg Leu Ser Tyr Asn Met Lys Asp Asp Pro Ala Ala Lys Thr Asn 

210 215 220 

Asn Leu Leu Glu Met Leu Arg Asn Val Pro Leu Val Thr Val Asp Gly 
10 22 5 230 

235 240 
Gin Gly Asn He Gin Val Lys Gly Ser Ser Asn Phe Lys Il e His Leu 

245 250 255 

Asn Gly Arg Pro Ser Thr Met Val Ser Ser Asn Pro Lys Glu Val Phe 

270 

e Thr Asp 

285 

Pro Gly Val Lys Tyr Asp Ala Glu Gly Thr Ser Ala lie Leu Asp He 

290 295 3 00 

Val Thr Glu Glu Gly Lys Lys Leu Glu Gly Tyr Ser Gly Ser He Thr 
20 305 310 315 

320 



260 265 



15 Arg Ser Ile Pro Ala His Thr He Lys Arg Val Glu Val II 
275 280 



Ala Ser Val Ser Asn Asn Pro Thr Ala Asn Gly Ser Ile Phe Leu Thr 

325 330 335 

Ala Lys Ser Gly Lys Val Gly Leu Thr Thr Asn Tyr Asn Tyr Tyr Gly 



340 345 



. 3 SO 

25 Gly Lys Asn Lys Gly Ser 



30 385 



Arg Tyr Phe Thr Glu Arg Thr Thr Ser Met 

365 

Leu Gin Thr He Glu Glu Gly Lys Gly Gin Glu Thr Phe Gly Gly His 

370 375 330 

Phe Gly Asn Ala Leu Leu Ser Phe Glu lie Asp Ser Leu Asn Leu Phe 



355 36o 



390 



395 4 0 o 



Thr Val Gly Gly Asn Val Arg Leu Trp Glu Met Thr Thr Asp Arg Asn 

405 410 415 

Ser Val Glu Lys Ser Phe Ala Gly Ser Asn Leu Met Ser Tyr Ile Asp 
420 425 430 

35 Arg Lys Leu Lys Thr Gin Met Asp Ala Gly Ser Tyr Glu Leu Asn Ala 



435 44 0 



445 
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Asp Tyr Gin His Ser Thr Arg Leu Pro Gly Glu Leu Leu Thr Val Ser 

450 455 460 

Tyr Arg Phe Thr His Asn Pro Asn Asn Ser Glu Thr Phe lie Asp Gin 
465 470 475 480 

5 Trp Lys Arg Asp Pro Leu Asn Thr Ala Asn Thr lie Gin Tyr Ala Gly 

485 490 495 

Gin His Ser Lys Ser Asp Ala Gly Met Asp Glu His Thr Ala Gin Val 

500 505 510 

Asp Tyr Thr Arg Pro Leu Gly Gin Ala His Ser Leu Glu Ala Gly Leu 
10 515 520 525 

Lys Tyr lie Tyr Arg His Ala Thr Ser Asp Pro Leu Tyr Glu lie Arg 

530 535 540 

Pro Ser Glu Asp Ala Pro Trp Gin Pro Gly Ser Leu Tyr Ala Gin Asn 
545 550 555 560 

15 Pro Ser Asn Gly Lys Phe Arg His Asp Gin Tyr lie Gly Ala Ala Tyr 

565 570 575 

Ala Gly Tyr Asn Tyr Arg Lys Asp Gin Tyr Ser Leu Gin Thr Gly Leu 

580 585 590 

Arg Val Glu Ser Ser Arg Leu Lys Ala Leu Phe Pro Glu Asn Ala Ala 
20 595 600 605 

Ala Asp Phe Ser His Asn Ser Phe Asp Trp Val Pro Gin Leu Thr Leu 

610 615 620 

Gly Tyr Thr Pro Ser Pro Met Lys Gin Leu Lys Leu Ala Tyr Asn Phe 
625 630 635 - 640 

25 Arg lie Gin Arg Pro Ala lie Gly Gin Leu Asn Pro Tyr Arg Leu Gin 

645 650 655 

Thr Asn Asp Unk Gin Val Gin Tyr Gly Asn Pro Asp Leu Lys Ser Glu 

660 665 670 

Lys Arg His His Val Gly Leu Ser Tyr Asn Gin Tyr Gly Ala Lys Val 
30 675 680 685 

Met Leu Thr Ala Ser Leu Asp Tyr Asp Phe Cys Asn Asn Ala lie Gin 

690 695 700 

Asn Tyr Thr Phe Ser Asp Pro Ala Asn Pro Asn Leu Phe His Gin Thr 
705 710 715 720 

35 Tyr Gly Asn He Gly Arg Glu His Ser Phe Ser Leu Asn Thr Tyr Ala 

725 730 735 
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Met Tyr Thr Pro Ala 
740 

Asp Arg Thr Phe Gin 
755 

Trp Ser Gly Met Val 
770 

Trp Thr Val Asn Leu 
785 

Gin Thr Lys Tyr Asp 
805 

Gin Leu Phe Asp Lys 
820 

His Ala Lys Tyr Ser 
835 

Thr lie Tyr Ser Glu 
850 

Leu Thr Tyr Ser Phe 
865 

Arg Thr lie Val Asn 
885 

Gin Gly Gly Gly Gin 
900 



Val Trp Val Arg He Met 
745 

Lys Ser Glu Ala Leu Gly 
760 

Tyr Ser Gly Leu Met Phe 
775 

Phe Gly Gly Tyr Tyr His 
790 795 
Gly Asn Val Phe Asn Asn 
810 

Lys Leu Arg Val Ser Leu 
825 

Thr Trp Lys Ser Arg Thr 
840 

Asn Ala Gly He Gin Arg 
855 

Gly Lys Met Asn Thr Gin 
870 875 
Asp Asp Leu Lys Gin Thr 
890 

Gly Asn Pro Thr Gly Asn 
905 



Leu Asn Gly Asn He 
750 

lie Asp Val Asn Ser 
765 

Thr Leu Pro Lys Asp 
780 

Gly Gly Arg Ser Tyr 
800 

He Gly He Ala Lys 
815 

Ser Ala Asn Asn He 
830 

He Gly Asn Gly Phe 
845 

Ser Val Ser Leu Ser 
860 

Val Arg Lys Val Glu 
880 

Ser Ser Gin Gly Gin 
895 



(2) INFORMATION FOR SEQ ID NO: 47 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1...606 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

lie lie Phe Cys Thr lie His His Ser Glu Leu Glu lie Met Asn Ser 
15 10 15 

10 lie Met Lys Tyr Gin Leu Tyr Thr Ala Val He Met Ala Leu Ser Val 
20 25 30 

Ser Ser Val Cys Gly Gin Thr Pro Arg Asn Thr Glu Thr Lys Arg Pro 

35 40 45 

Asp Thr Leu Arg Arg Glu Leu Thr He Val Asn Asp Gin Thr Val Glu 
15 50 55 60 

Met Glu His Ala Asp Pro Leu Pro Ala Ala Tyr Lys Ala He Glu Pro 
65 70 75 80 

Arg Leu Lys Pro Phe Arg Pro Glu Tyr Asn Lys Arg Thr Phe Gly Phe 
85 90 95 

20 Val Pro Glu Val Ser Ser Ser Gly Arg Asn Asn Leu Pro Asn He Leu 
100 105 HO 

Pro Thr Glu Gly His Met Lys His Arg Gly Tyr Leu Asn He Gly He 

115 120 125 

Gly His Thr Leu Asn Gin Arg Met Asp Ala Gly Tyr Arg Leu He Asp 
25 130 135 140 

Ala Glu Gin Glu Arg Leu Asn Leu Phe Leu Ser Tyr Arg Gly Met Lys 
145 150 155 160 

Ser Ala Phe Asn Thr Gly Asp Phe Asp Gly Asp Arg Lys Asp Arg Arg 
165 170 175 

30 Met Met Ala Gly Val Asp Tyr Glu Gin Arg Arg Pro Ser Phe Val Leu 
180 185 190 

Ala Thr Gly Leu Tyr Tyr Ser Asn His Tyr Phe Asn Asn Tyr Gly Arg 

195 200 205 

Gly Ala Thr Thr Asn Val Gly Ser He Pro Gin Leu Ser Thr Pro Val 
35 210 215 220 

Thr Pro Gin Met Asp Asn Gly Thr His Asn Val Arg Val Tyr Leu Gly 
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225 230 2 35 240 

Ala Lys Asn Asp Val He Asp Ala Arg He Asp Tyr Arg Phe Phe Arg 

245 250 255 

Ser He Pro Tyr Leu Gly Thr Asp Pro Met Lys Ala Leu Thr Glu His 

260 265 270 

Thr Pro Glu Leu Asn Val Thr Met Ser Asn Glu Leu Ser Asp Asp He 

275 280 285 

Lys Leu Gly Val Glu Val Arg Thr Gly Gly Leu Phe Phe Ala Lys Asn 

290 295 300 

Ser Glu Met He Gin Thr Gly Val Leu Ser Glu Thr Asp Arg Asn Leu 
305 310 315 320 

Tyr Tyr Val Glu Gly Ala Pro Thr He Gly Phe Val Gly Asp Ser Asp 

325 330 335 

Asn Met Gin Trp Asn He Gin Ala Gly Val Gly He Ser Ser His Phe 

340 345 350 

Gly Ala Lys Gly Arg Leu Phe Phe Trp Pro Lys Leu Asp Ala Ser Leu 

355 360 365 

Ser He Phe Pro Ser Trp Arg Val Tyr Ala Lys Ala Phe Gly Gly Val 

370 375 380 

He Arg Asn Gly Leu Ala Asp Val Met Gin Glu Glu Met Pro Tyr Leu 
385 390 395 400 " 

Met Pro Asn Thr He Val Leu Pro Ser Arg Asn Ala Leu Thr Ala Gin 

405 410 415 

Leu Gly Val Lys Gly Asn He Ala Asp Val Val Arg Met GLu Val Tyr 

420 425 430 

Gly Asp Phe Ser Lys Leu Thr Gly Val Pro Phe Tyr Thr Pro Thr Leu 

^35 440 445 

Pro Leu Tyr Asn Pro Ser Asp Leu Tyr Gin Tyr Asn Val Ser Phe Leu 

450 455 460 

Pro He Tyr Ala Asp Gly Ser Arg Trp Arg Ala Gly Gly Lys Leu Glu 
465 4 ?0 475 480 

Tyr Ser Tyr Arg Asp Met Leu Arg Phe Leu Val Asp Ala Ser Tyr Gly 

485 490 495 

Lys Trp Asn Leu Asp Gly Gly Leu Val Ala Ser Met Gin Pro Asp Leu 

500 505 5io 

He Leu Lys Ala Glu Val Gly Val His Pro He Ala Pro Leu Asp Val 
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515 520 525 

Arg Leu Arg Tyr Thr Gin Leu Asn Gly Arg Tyr Arg Tyr Ser Phe Gly 

530 535 540 

Ser Ala Gly Ser Glu Ala Leu Gly lie Gly Asn Val His Leu Leu Ser 
5 545 550 555 560 

Ala Asp Val Ser Tyr Lys Leu Lys Lys Asn Leu Ser Leu Tyr Leu Lys 

565 570 575 

lie Asp Asn Met Leu Ala Glu Thr Thr Glu Leu lie Gly Tyr Tyr Pro 
580 585 590 

10 Met Gin Pro Phe His Cys Phe Ala Gly Phe Ser Trp Thr Phe 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 48 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . . 357 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 



Asn Ser His Ala Thr Val Ser Val lie Cys Ser Met Met Glu Lys Cys 
15 10 15 

35 lie Phe Ala His Tyr Pro His Asn Leu Val Phe Met lie Arg Lys His 
20 25 30 
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Phe Gly lie He Leu Gly Phe Leu Ser Leu Val Phe Ser Ala Gly Ala 

35 40 45 

Gin Gin Glu Lys Gin Val Phe His Phe Leu Asn Leu Pro Ala Thr Ala 

50 55 60 

Gin Ala Leu Ala Ala Gly Gly Lys Ala He Thr He Val Asp Asp Asn 
65 70 75 80 

Pro Gly Leu Ala Phe Glu Asn Pro Ala Leu Leu Gly Tyr Glu Ser Gly 

85 90 95 

Gly Arg Ala Phe Leu Ser Tyr Leu Tyr Tyr Met Ser Gly Ser His Met 

100 105 no 

Gly Asn Ala Cys Tyr Ala Ser Ser Val Gly Glu Arg Gly Met Trp Gly 

115 120 125 

Val Gly Met Arg Phe Leu Asn Tyr Gly Ser Met Gin Gly Tyr Asp Gin 

130 135 140 

Asn Ala He Ala Thr Gly Ser Phe Ser Ala Ser Asp He Ala Val Gin 
145 ISO 155 160 

Gly Phe Tyr Ser His Glu Leu Ser Asn His Phe Arg Gly Gly Val Ser 

165 170 175 

Leu Lys Ala Leu Tyr Ser Ser He Glu Thr Tyr Ser Ser Phe Gly Leu 

180 185 190 

Gly Val Asp Val Gly He Ser Tyr Tyr Asp Asp Asp Lys Gly Tyr Ser 

I 95 200 205 

Ala Ser Ala Leu Phe Lys Asn Val Gly Ala Gin Leu Lys Gly Tyr Asn 

21° 215 220 

Glu Glu Arg Glu Pro Leu Asp Trp Asp Phe Gin Leu Gly Phe Ser Arg 
225 230 235 240 

Ser Phe He Asn Ala Pro Phe Arg Leu His He Thr Leu Phe Asn Leu 

245 250 255 

Asn Pro His Tyr Phe Lys Arg Leu Val Pro Arg Asp Leu Ser Lys Met 

260 265 270 

Gin Lys Phe Leu Arg His Phe Ser He Gly Ala Glu Phe Thr Pro Ser 

275 280 285 

Glu Arg Phe Trp Val Gly Leu Gly Tyr Thr Pro Gin He Ala Gin Asp 

2 ^0 295 300 

Phe Glu Val Glu Gly Gly Asn Lys Trp Gly Gly Leu Ser Ala Gly Val 
305 310 315 320 
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Gly Phe Thr Ser Gly Val Val Arg Val Gly Val Ser Ala Ala Thr Tyr 

325 330 335 

His Pro Ala Ala Leu. Ser Phe Met Cys Ser Val Gly He Arg Leu Asp 

340 345 350 

Asp Lys Ser He Phe 
355 



(2) INFORMATION FOR SEQ ID NO: 49 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION 1...231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

Lys Phe Met Met Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu 

15 10 I 5 

Phe Ser Ser Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg 

20 25 30 

Ser Tyr Phe Thr Gly Arg He Glu Lys Val Ser Leu Asn Leu Gly Val 

35 40 45 

Pro Pro Val Ser Thr Glu Val Trp Gly Met Thr His Asp Ala Asn Gly 

50 55 60 

Leu Pro Phe Glu He Pro He Ser Phe Ser Arg Phe Asn Ser Gin Gly 
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65 

Asp lie Ala Thr Thr 
85 

Glu Trp Cys Asp Tyr 
100 

Arg Phe Trp Lys Met 
115 

Thr Arg lie Thr Phe 
130 

lie Pro Lys Pro Lys 
145 

Cys lie Arg Thr Glu 
165 

Asp Thr Phe Val Ser 
180 

Pro Phe Leu Leu Asn 
195 

Lys Tyr Ala Val Gly 
210 

Gin Thr Val Ser Gin 
225 



70 

Tyr Tyr lie Ala Asn 
90 

Ala His Pro Gly Gly 
105 

Thr Tyr Asn lie Pro 
120 

Glu Asn Gin Glu lie 
135 

Val Ser Leu Pro His 
150 

Ala Gly Arg Glu Phe 
170 

His Asp Gly Asn Glu 
185 

Thr Asn Val Lys lie 
200 

Val Gly Glu lie Arg 

215 
Gin Lys 
230 



75 80 
Ser Glu Ala Thr Leu Asn 
95 

lie Val Arg Val Glu Gly 
110 

Thr Tyr Asn Ala Val Cys 
125 

Glu Gly Thr lie Val Leu 
140 

Val Ser Glu Ser Val Pro 
155 160 
lie Leu Cys Glu Glu Asp 
175 

Val Thr lie Gly Gly Lys 
190 

Val Gly Asp Val Ser Gin 
205 

Phe Leu Gin lie Cys Ala 
220 



(2) INFORMATION FOR SEQ ID NO: 50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME/KEY: mis c_f ea ture 

(B) LOCATION 1 . . . 566 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

Tyr Asp Gly Ala Arg Leu Val Tyr Thr Leu Phe Arg Asn Arg Asn Asp 
15 10 15 

He His Pro Met Lys Thr He Val Arg Tyr Ser Arg Leu Pro Val Ala 
10 20 25 30 

Leu Phe Phe Cys Leu Leu Gly Ala Val His Leu Ser Val Glu Ala Gin 

35 40 45 

Met Leu Asn Thr Pro Phe Glu Leu Ser Asp Gin He Val Leu Ser Pro 
50 55 60 

15 Thr Glu Arg Gin Tyr Arg Glu He Cys Val Gin Thr Lys Glu Lys Arg 
65 70 75 80 

Gly Ala Asp Leu Phe Pro Leu Ser Asp Lys Leu Arg Asp Ser Ala Tyr 

85 90 95 

Val Arg Phe Gly Ser Ala Tyr Gly Asp He Ala Gly Asp Tyr Leu Pro 
20 100 105 HO 

Tyr Asn Gly Asn Asn Tyr Ser Ser Leu Ser Leu Glu Ser Gly Gly Arg 

115 120 125 

He Ser Val Arg Asn Tyr Gly Thr Leu Gin Gly Ser Ala Ser Tyr Ser 
130 135 140 

25 Arg Gly Met His Lys Arg He Gly Trp Asn Ala Leu Arg Asn Ala Glu 
145 150 155 160 

Ala Tyr Tyr Pro Tyr Leu Val Ser Asp Ser Thr Gly Gly Asp Tyr His 

165 170 175 

Phe Glu Asp Tyr Arg Leu Ala Gly Tyr Tyr Ser Phe Arg Ala Gly Arg 
30 180 185 190 

Leu Pro Leu Gly He Gly Phe Ser Tyr Arg Gly Glu Val Ala Tyr Arg 

195 200 205 

Leu Thr Asp Pro Arg Thr Thr Asn Thr Thr Gly Ala Leu Glu Leu Ser 
210 215 220 

35 Cys Ala Thr Ser Leu Thr Leu Pro Arg Glu Asn Arg Leu Ser Leu Ser 
225 230 235 240 



95 



Ala Ala Tyr Leu Tyr His Arg Gin His Leu Thr Gin Tyr Asn Trp Arg 

245 250 255 

Pro Gly Gin Gin Asp Lys Phe Phe Val Ser Tyr Gly Phe Gly Gin Val 
260 265 270 

5 Asp Val Ser Asn Ser Pro lie Trp Phe Gly lie Ser Arg Met Asn Tyr 
275 280 285 

Val Asn Gly Trp Lys Leu Ser Ser Arg Leu Asp Thr Arg Arg Gly Asp 

290 295 300 

Ala lie Gly Leu Asp Tyr Ser Gly Tyr Phe Leu Asp Thr Glu Glu Arg 
10 305 310 315 320 

Ser Ser lie Asn Leu Phe Ala Leu Leu Tyr Asn Arg Leu Arg Leu Tyr 

325 330 335 

Gly Ser Trp His Leu Ser Asp Phe Asp Phe Ser Phe Ser Ala Asp Tyr 
340 345 350 

15 Ala Leu Arg Gin Gly lie Glu Arg lie Tyr Glu Asp Tyr Lys Pro Asp 
355 360 365 

Asp Asn Tyr His lie Tyr Asp Leu Arg lie Leu Ala lie Arg Arg Trp 

370 375 380 

Tyr Met Leu Asn Glu Phe Ser Ala Gin Ala Gin Ala Ser Tyr Arg lie 
20 385 390 395 400 

Arg Thr Asp Arg Gly Cys Ala Leu Arg Val Ser Ala Gly Ser Asp Phe 

405 410 415 

Tyr Gly Tyr Asp Glu Thr Tyr Arg Lys His Gly His His Thr Met Ser 
420 425 €30 

25 Gly Met Leu Arg Pro Phe Ala Gly lie Ala Tyr Asp His Ala Gly Ser 
435 440 445 

Lys Leu Asp Phe Gly Leu Ser Leu Ser Ala Ala Tyr Arg Met Val Leu 

450 455 460 

Thr His Ser Tyr Lys lie Arg Thr lie Gin Lys Glu Gin Leu Asp Tyr 
30 465 470 475 480 

Gin Leu Ala Tyr Leu Pro Tyr Ala Tyr Arg Asn Arg Glu Gly Val Glu 

485 490 495 

Val Arg Ser Ser Leu Tyr Val Ser lie Pro Met Gin Asn Thr His Arg 
500 505 510 

35 Leu Met Thr Glu Leu Arg Leu Tyr Gly Asp Leu Met Lys Arg Lys Asp 
515 520 525 
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Gly lie Ala Tyr Gly Lys Thr Pro Gly Val lie Ser His lie Leu Ser 

530 535 540 

Asp Pro Gin Ala Glu Arg Thr Ser Gly His Thr lie Gly Ala lie Cys 
545 550 555 560 

5 Asn lie Ser Tyr Leu Phe 

565 

(2) INFORMATION FOR SEQ ID NO: 51 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1...819 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 



Arg Thr Asn Val Phe Leu Ser Leu Ser His Lys lie Gly Arg Arg Gly 
15 10 15 

30 Ala Ser Cys Ser Asn Arg Asn Ala Trp Met Ala Glu Asn Lys Pro Ser 
20 25 30 

Ser Pro Glu Pro Asp Asn Thr Gly Val Gly Asn Ser Pro Ser Asp Tyr 

35 40 45 

Leu His Gly Glu Ala lie lie Pro Pro Leu Ser Ser Leu Ser Asn Phe 
35 50 55 60 

Asn Asp Lys Arg Phe Met Lys Lys Leu His Met lie Ala Ala Leu Ala 
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65 70 75 80 

Val Leu Pro Phe Cys Leu Thr Ala Gin Ala Pro Val Ser Asn Ser Glu 

85 90 95 

lie Asp Ser Leu Ser Asn Val Gin Leu Gin Thr Val Gin Val Val Ala 

100 105 no 

Thr Arg Ala Thr Ala Lys Thr Pro Val Ala Tyr Thr Asn Val Arg Lys 

115 120 125 

Ala Glu Leu Ser Lys Ser Asn Tyr Gly Arg Asp lie Pro Tyr Leu Leu 

130 135 140 

Met Leu Thr Pro Ser Val Val Ala Thr Ser Asp Ala Gly Thr Gly lie 
"5 150 155 160 

Gly Tyr Ser Gly Phe Arg Val Arg Gly Thr Asp Ala Asn Arg He Asn 

165 170 175 

He Thr Thr Asn Gly Val Pro Leu Asn Asp Ser Glu Ser Gin Ser Val 

180 185 190 

Phe Trp Val Asn Met Pro Asp Phe Ala Ser Ser He Glu Asp Leu Gin 

195 200 205 

Val Gin Arg Gly Val Gly Thr Ser Thr Asn Gly Ala Gly Ala Phe Gly 

21° 215 220 

Ala Ser Val Asn Met Arg Thr Asp Asn Leu Gly Leu Ala Pro Tyr Gly 
225 230 235 240 

Arg Val Asp Leu Ser Gly Gly Ser Phe Gly Thr Phe Arg Arg Ser Val 

245 250 255 

Lys Leu Gly Ser Gly Arg He Gly Arg His Trp Ala Val Asp Ala Arg 

260 265 270 

Leu Ser Lys He Gly Ser Asp Gly Tyr Val Asp Arg Gly Ser Val Asp 

275 280 285 

Leu Lys Ser Tyr Phe Ala Gin Val Gly Tyr Phe Gly Ser Asn Thr Ala 

290 295 300 

Leu Arg Phe lie Thr Phe Gly Gly Lys Glu Val Thr Gly He Ala Trp 
305 310 315 320 

Asn Gly Leu Ser Lys Glu Asp Glu Ala Lys Tyr Gly Arg Arg Tyr Asn 

325 330 335 

Ser Ala Gly Leu Met Tyr Val Asp Ala Gin Gly Val Pro His Tyr Tyr 

3 40 345 350 

His Asn Thr Asp Asn Tyr Glu Gin Arg His Tyr His Ala He Met Thr 
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355 

His Ser Phe 
370 

Ala Gly Tyr 
5 385 

Glu Tyr Ala 

Thr Asp Leu 

10 lie Gly Ser 
435 

Ala Ser Gly 

450 
lie Lys Lys 
15 465 

Asn Arg Ala 

Gin lie Thr 

20 lie Gly Tyr 
515 

Gly Ser Met 

530 
Pro Lys Ala 
25 545 

Ala Ser Val 

Glu Ala Gly 

30 Glu Leu Gly 
595 

Leu Tyr Tyr 

610 
Ser Asp Val 
35 625 

Met Gly Leu 



Ser Pro Ser Val 
375 

Gly Tyr Thr Asp 
390 

Leu Gin Pro Tyr 
405 

lie Arg Gin Lys 
420 

Leu Asn Trp His 

Asn lie Tyr Lys 
455 

Tyr Asn Gin Pro 
470 

Asp Lys Arg Glu 
485 

Pro Glu Leu Asn 
500 

Thr lie Asn Gly 

Gin His lie Asp 
535 

Gly Leu Thr Tyr 
550 

Ala Val Ala His 
565 

lie Gly Gin Tyr 
580 

Tyr Arg Tyr Ala 

Met Gin Tyr Lys 
615 

Gly Gin Met Leu 
630 

Glu Leu Thr Leu 



360 

lie Leu Asn Leu Thr 
380 

Glu Tyr Arg Thr Gly 
395 

Val Glu Asn Ser Val 
410 

Tyr Leu Asp Asn Asp 
425 

Thr Gly Ala Trp Asp 
440 

Gly Asp His Phe Gly 
460 

Leu Ala Pro Asp Phe 
475 

Gly Ala Ala Phe Ala 
490 

Met Tyr Ala Asp Leu 
505 

lie Thr Asp Glu Tyr 
520 

Leu Asp Lys Thr Phe 
540 

Ser Phe Asp Asp Ala 
555 

Arg Glu Pro Asn Arg 
570 

Pro Thr Pro Glu Arg 
585 

Ser Pro Leu Leu Ser 
600 

Asp Gin Leu Val Leu 
620 

Thr Ser Asn Val Pro 
635 

Gly Trp Gin lie Leu 



365 

Ala His Tyr Thr 

Arg Lys Leu Lys 
400 

Thr Val Lys Lys 
415 

Phe Gly Gly Leu 
430 

Leu Gin Phe Gly 
445 

Arg lie Thr Tyr 

Glu Tyr Tyr Arg 

480 

Lys Ala Asn Trp 
495 

Gin Tyr Arg Thr 
510 

Asp Glu Val Gin 
525 

Arg Phe Leu Asn 

His Thr Ala Tyr 
560 

Thr Asn Tyr Thr 
575 

Leu lie Asp Tyr 
590 

Ala Gly Val Gly 
605 

Asp Gly Arg Leu 

Asp Ser Tyr Arg 
640 

Pro Arg Leu Leu 
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645 650 655 

Arg Trp Asp Ala Ser Phe Thr Met Ser Arg Asn Lys lie Asp Arg Tyr 

660 665 670 

Val Gin Tyr Thr Ser Val Tyr Asp Ala Asp Tyr Asn Trp Leu Glu Leu 
5 675 680 685 

Lys Glu Glu Thr Leu Glu Ser Thr Asp lie Ala Tyr Ser Pro Asn Val 

690 695 700 

lie Ala Gly Ser Met Leu Thr Leu Ser His Ala Gly Phe Glu Met Ala 
705 710 715 720 

10 Trp Thr Ser Arg Phe Val Ser Lys Gin Tyr Leu Asp Asn Thr Gin Arg 

725 730 735 

Ser Asp Arg Met Leu Ser Ser Tyr Trp Val Asn Asp Leu Arg Leu Gly 

740 745 750 

Tyr Val Leu Pro Val His Phe Val Lys Arg Val Ala Leu Gly Val Gin 
15 755 760 765 

Leu Asn Asn Leu Phe Asn Leu Met Tyr Ala Ser Asn Ala Tyr lie Tyr 

770 775 780 

Asp Ala Gly Tyr Val Gin Ala Ser Gly Glu Leu Ser Ala Tyr Ala Asp 
785 790 795 800 

20 Leu Arg Tyr Tyr Pro Gin Ala Gly Phe Asn Ala Leu Gly Unk Leu Thr 

805 810 815 

lie Asp Phe 



25 (2) INFORMATION FOR SEQ ID NO: 52 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

5 (B) LOCATION 1...319 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

Phe Thr Ser Gly Thr lie Phe Val Arg lie Ala lie Ser Gly Arg Val 
10 1 5 10 15 

Val Cys Arg Glu Val Glu Asp Asn Gly Arg Arg Glu Ala Val Arg His 

20 25 30 

Asp Gly Met Val Val Thr Leu Leu Val lie Val Gly lie Val Val Val 
35 40 45 

15 Val Arg Tyr Ser Leu Arg Val His Val His Lys Thr Gly Thr Val Val 

50 55 60 

Ser Ala Ala lie Phe Gly Phe lie Leu Leu Gly Lys Thr Val Pro Cys 
65 70 75 80 

Asp Thr Arg Asn Phe Phe Ser Ser Glu Ser Asp Glu Pro Glu Ser Arg 
20 85 90 95 

Val Ala Thr Glu lie Ala His Leu Cys Glu lie Gly Phe Gin lie His 

100 105 110 

Ala Ser Ser lie His Val Ala Val Arg Thr Asp Phe Gly Gin Ala Gly 
115 120 12.5 

25 lie His Cys Pro Met Ala Thr Asp Ala Ser Ala Thr Glu Phe Asp Arg 

130 135 140 

Ser Ala Glu Cys Ala Glu Arg Thr Ser Ala Gin lie Asp Thr Ala lie 
145 150 155 160 

Arg Ser Gin Ser Gin lie lie Arg Thr His lie Asp Thr Cys Pro Lys 
30 165 170 175 

Ser Ser Gly Thr lie Gly Gly Ser Thr His Thr Ser Leu His Leu Lys 

180 185 190 

Val Phe Asp Gly Arg Gly Glu Val Gly His lie His Pro Lys Asp Gly 
195 200 205 

35 Leu Arg Phe Gly Val Val Glu Gly Tyr Ser He Gly Ser Tyr Val Asp 

210 215 220 
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Ala lie Gly lie Gly Ala Thr His Ala Lys Ala Gly lie Ser Asp Th 
225 230 235 24 

Arg Thr Gly lie Ala Gly Gly Tyr His Gly Gly Ser Gin His Gin Gl] 

245 250 255 

He Gly Asp Val Thr Thr He He Gly Leu Gly Lys Phe Gly Leu Al; 

260 265 270 

Asn Val Gly Val Ser Asp Arg Gly Phe Arg Arg Gly Ala Ser Ser Ty: 

275 280 285 

Asp Leu Tyr Gly Leu Glu Leu His He Ala Lys Thr He Tyr Leu Al; 

290 295 300 

Val Gly Asp Gly Cys Leu Cys Arg Gin Ala Glu Arg Gin Asp Gly 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 53 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1. . .243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 



Ser Thr He Met Lys Lys Ala He Leu Ser Gly Ala Ala Leu Leu Leu 

1 5 10 15 

Gly Leu Cys Ala Asn Ala Gin Asn Val Gin Leu His Tyr Asp Phe Gly 
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20 

His Ser lie Tyr 
35 

Val Glu Asn Phe 
50 

Asp Met Asp Tyr 
65 

Ser Arg Glu Leu 

Tyr Asn Gly Gly 
100 

lie Gly Ala Thr 
115 

Thr lie Thr Pro 
130 

Tyr Gin lie Thr 
145 

Thr Phe Asn Gly 

He Gly Gly Pro 
180 

Glu Pro Gin Phe 
195 

Asp Phe Asn Leu 
210 

Ala Arg Met Asp 
225 

Thr Phe Asn 



Asp Glu Leu Asp 
40 

Thr Pro Asp Lys 
55 

Thr Gly Lys Gly 
70 

Lys Phe Trp Gin 
85 

Leu Ser Thr Ser 

Tyr Thr Tyr Asn 
120 

Met Tyr Lys His 
135 

Gly Thr Trp Tyr 
150 

Phe Leu Asp Leu 
165 

Val Leu Lys Glu 

Trp He Asn Leu 
200 

Ser He Gly Thr 
215 

Lys Phe Ser Cys 
230 



25 

Gly Arg Pro Lys 

Trp Gly Ser Thr 
60 

He Gin Ser Ala 
75 

Ala Pro Val Ser 
90 

Phe Thr Phe Gly 
105 

Asn Pro Ser Phe 

Leu Gly Ala His 
140 

Met His Phe Leu 
155 

Trp Gly Phe Pro 
170 

Gly Asp Lys Phe 
185 

Asn Arg He Lys 

Glu Met Glu He 
220 

He Pro Thr Leu 
235 



30 

Leu Thr Thr Thr 
45 

Phe Phe Phe He 

Tyr Trp Glu He 
80 

He His Leu Glu 
95 

His Asp Ala Leu 
110 

Thr Arg Gly Phe 
125 

Asp Phe His Thr 

Asp Gly Leu Leu 
160 

Gin Glu Asn Pro 
175 

Val Phe Leu Ser 
190 

Gly He Asp Lys 
205 

Ser Axg Asn Phe 

Ala Val Lys Trp 
240 



(2) INFORMATION FOR SEQ ID NO: 54 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .450 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

Trp Cys Gin Ser Asp Pro lie Pro His Cys His Ala Val Tyr Arg Asn 

1 5 10 15 

Ala Gin Gly His Gin Gin Gly Arg Ser Asp Gin Asp Arg Leu Pro Pro 

20 25 30 

Val Arg Gly Tyr Arg Tyr Ala Arg Cys Leu Gly Arg Tyr Glu Gly Phe 

35 40 45 

Gly Glu Arg Met Arg Leu Ser Ala lie Leu lie Ala Leu lie Val Met 

50 55 60 

Leu Pro Ala Val Leu Ser Gly Gin His Tyr Tyr Ser Met Ala Gly Glu 
65 70 75 80 

Arg Leu Glu Thr Asp Ser lie Arg Pro Asn Glu Leu Ser Ala Ser lie 

85 90 95 

Arg Ser Ala Leu Phe Phe Arg Asn Asn Glu Tyr Asn Ala Arg Ser Val 

100 105 HO 

Lys Gly Tyr Thr Leu Pro Gly Ala Arg Val Ser Ala Phe Ala Ser Tyr 

115 120 125 

Ser Leu Pro Ala Ala His Gly Val Lys Leu Ser Leu Gly Val Ser Thr 

130 135 140 

Leu Asn Tyr Trp Gly Ala Ser Arg Tyr Pro Ala Gly lie Ala Tyr Ser 
i45 150 155 160 

Asp Leu Pro Tyr Trp Thr Asp Tyr Asn Asp Tyr Val Arg Leu Arg lie 
165 170 175 



104 



Leu 

Met 

5 lie 

Gin 
225 
Trp 

10 

Phe 
Arg 
15 Tyr 

Val 

305 
He 

20 

Phe 

Leu 

25 Val 

Gin 
385 
Tyr 

30 

Val 
Leu 
35 Asn 



Pro Tyr 

Leu Gly 
195 
Tyr Asn 
210 

Phe Arg 

Met Ser 

Gly Leu 

Leu Glu 
275 
Asn Trp 
290 

Gly Leu 

Trp Gly 

Pro Tyr 

Glu His 
355 
Ser Pro 
370 

Pro Leu 

Ser Trp 

Trp Phe 

Thr Met 
435 

His 
450 



Val Gin 
180 

As n He 

Pro Glu 

Gly Asp 

Met He 
245 
Ser Thr 
260 

Leu Pro 

Ala Gin 

Lys Leu 

Ser Ala 
325 
Glu Arg 
340 

Phe Ala 

Phe Ala 

Thr Asn 

Arg Met 
405 
Gin Pro 
420 

Arg He 



Ala Met 

Ala Gly 

Leu Asp 
215 
Trp Thr 
230 

Phe Lys 

Thr Ser 

Leu Gin 

Gin Asp 
295 
Ser Tyr 
310 

Tyr Gly 

Gly Trp 

Phe Arg 

Ala Pro 
375 
Gly Trp 
390 

Ala Arg 
Ser Asp 
Asp Pro 



Leu Lys 
185 
Gly Thr 
200 

Leu Thr 

Arg Phe 

Asn Asp 

Lys Leu 
265 
Ala He 
280 

Thr Val 

Arg Pro 

Val Ala 

Gly Gly 
345 
Thr Asp 
360 

Phe Ala 

Gly Asp 

Ser Val 

Arg Phe 
425 
Lys Phe 
440 



Pro Thr 

Ala His 

Ala Asp 

Arg Met 
235 
Asn His 
250 

Leu Ser 
Ala Thr 
His Thr 

Arg Thr 

315 
Ala Leu 
330 

Tyr Leu 

Tyr Trp 

Asn Ser 

Tyr He 
395 
Ser Leu 
410 

Ala Met 
Pro He 



Ala Thr 

Gly Leu 
205 
Pro Glu 
220 

Asp Val 

Gin Glu 

Gly Glu 

His Arg 
285 
Trp Val 

300 

Asp Lys 

Ser Ser 

Ser Leu 

Tyr Gly 
365 
Leu Thr 
380 

Arg Leu 

Ala Ala 

Ser His 

Ala Phe 
445 



Thr Ala Leu 
190 

He Glu Pro 

Ala Gly Val 

Trp Val Asn 
240 

Ser Phe Val 

255 
Gly Lys Trp 
270 

Gly Gly Glu 
Asn Gly Ala 

Pro Met Gin 

320 

Gly Gly Tyr 

335 
Gly Met Asp 
350 

Arg His Tyr 

Tyr Asp Lys 

Tyr Ala Asp 
400 

Val Ala Arg 
415 

Ala Leu Glu 
430 

Leu Lys Gly 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 

(B) LOCATION 1 . . .4 47 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

Pro Phe Ser Pro Pro Trp Val Cys Ala Gly Ala Ala Ser Ser Val Arg 

15 10 15 

Thr Arg Pro Cys Ser Thr Val Ser Ser Thr Ala Ala Ser Ser His Arg 

25 20 25 30 

Arg Ser Glu His Met Arg Ser Ala Asp Ser Ser Pro Ala Tyr Ser Pro 

35 40 45 

lie Gly Ser Leu Ser Ser Ser His Ser Phe Asp Ser lie Pro Asp Gin 

50 55 60 

30 Arg Met Asn Lys Ser Leu Leu Ser Leu Ala Cys Leu lie Leu Cys Gly 
65 70 75 80 

Met Pro Ala lie Ala Gin Gin Thr Gly Pro Ala Glu Arg Ser Gly Glu 

85 90 95 

Pro Ser Leu Ala Glu Arg Val Phe Gly Leu Glu Gin Lys Gin Lys Lys 

35 100 105 110 

Leu Lys Val Tyr Leu Gly lie Gin Ser Phe Tyr Asp Gin Pro Leu Val 
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Asp 

Ser 
5 145 
Arg 

Asn 

10 Ser 

Asp 

Met 
15 225 
Glu 

Ala 

20 Pro 

Leu 

Asn 
25 305 
He 

Lys 

30 val 

Gin 

Ala 
35 385 
Gly 



115 
Asp Glu 
130 

Ala His 

Leu Asn 

Ala He 

Phe Phe 
195 
Met Asn 
210 

Thr Cys 

Gin Gin 

Asp Arg 

Leu Leu 
275 
His Met 
290 

Met Trp 

Asn Gly 

Gly He 

Lys Tyr 
355 
Val Asn 
370 

Gin Tyr 
Gly Val 



Ser His 

Gly Glu 

Arg Ala 
165 
Asp He 
180 

Phe Gly 

Pro Val 

Tyr Thr 

Leu Gin 
245 
Tyr His 
260 

Tyr Ser 

Arg Tyr 

Tyr Phe 

Tyr Leu 
325 
Met Thr 
340 

Tyr Ala 
Leu Phe 
Gly Glu 
Glu Tyr 



He Gly 
135 
Leu Asn 
150 

Ala Asp 

Ala Gly 

Arg Gin 

Glu He 
215 

Ser Gly 
230 

Leu Gin 

Val Thr 

Ala Gin 

Ala Val 
295 
Thr Ala 
310 

Asp Leu 

Ala Arg 

Leu Val 

Leu Lys 
375 
Ser Ser 
390 

Tyr Pro 



120 

His Phe 

Arg His 

Gly Thr 

Val Asp 
185 
Tyr Ala 
200 

Tyr Gin 

Val Asn 

Val Leu 

Pro Asp 
265 
Trp Asn 
280 

Ser Met 

Gly Asn 

Thr Tyr 

Tyr Gly 
345 
Ser Lys 
360 

Gly Met 
His Thr 
Thr Glu 



Lys Val 

Leu Ser 
155 
Ser Phe 
170 

Trp His 

Arg Phe 

Tyr Ser 

Phe Ala 
235 
Asn Ala 
250 

Val Ala 

Gly Thr 

Ala His 

Leu Phe 
315 
Ser He 
330 

Lys Gly 

Trp Asn 

Tyr Glu 

Arg His 
395 
Thr Asn 



125 
Gin Glu 
140 

Phe Asp 

Ala Asp 

Pro Asn 

Gly Gly 
205 
Asp Leu 
220 

Trp Asn 

Tyr Asn 

Thr Ala 

Leu Leu 
285 
Gin Ala 
300 

Asn Pro 

Glu Gly 

Lys Thr 

Phe Arg 
365 
Asn Gly 
380 

Ser Tyr 
Phe Arg 



Leu Arg Met 

Trp Arg Gin 
160 

Asn Leu Ser 
175 

Asp Lys Val 
190 

He Glu Tyr 

Val Asp Tyr 

Phe His Pro 
240 

Asn Arg Phe 

255 
Thr Ser Tyr 
270 

Gly Gly Ala 

Gin Glu Arg 

Gly Lys Arg 
320 

Leu Asp Asp 
335 

Leu Thr Asp 
350 

He Phe Asp 

Tyr Ala Pro 

Gly Tyr Met 
400 

Leu Phe Val 



107 



405 

Thr Tyr lie Gly Arg His Tyr Arg 
420 

Thr Asn Ala Leu Arg Ala Gly Leu 
435 440 



410 415 
Tyr Ser Ala Thr Glu Thr Glu Ser 
425 430 
He Tyr Gin He Pro Phe Leu 
445 



(2) INFORMATION FOR SEQ ID NO: 56 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .532 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 



Tyr Arg Arg Ser Gly Arg Val Cys 
1 5 

Ser Tyr Ala Asp Thr Arg Leu Gin 
20 

Ala Gly Thr Arg Thr Lys Met Lys 

35 40 
Leu Tyr He Leu Ser Ser lie Ser 

50 55 
Val Gin Gly He Glu Leu Asp Thr 
65 70 



Pro Arg He Leu Arg Asn Lys Arg 

10 15 
Pro Arg Arg Ala Val Arg Leu Arg 
25 30 
Arg Arg Phe Leu Ser Leu Leu Leu 
45 

Leu Ser Ala Gin Arg Phe Pro Met 
60 

Asp Ser Leu Phe Ser Leu Pro Lys 
75 80 
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Arg Pro Trp Arg Ala lie Gly Lys Thr lie Gly Val Asn Leu Ala Val 

85 90 95 

Trp Gly Phe Asp His Phe lie Met Asn Glu Asp Phe Ala Asp lie Ser 
100 105 110 

5 Trp Gin Thr lie Lys Ser Asn Phe Gin Thr Gly Phe Gly Trp Asp Asn 
115 120 125 

Asp Lys Phe Val Thr Asn Leu Phe Ala His Pro Tyr His Gly Ser Leu 

130 135 140 

Tyr Phe Asn Ala Ala Arg Ser Asn Gly Leu Ser Phe Arg His Ser Ala 
10 145 150 155 160 

Pro Phe Ala Phe Phe Gly Ser Leu Met Trp Glu Leu Leu Met Glu Asn 

165 170 175 

Glu Pro Pro Ser He Asn Asp Leu Cys Ala Thr Thr He Gly Gly He 
180 185 190 

15 Ala Leu Gly Glu Met Gly His Arg Leu Ser Asp Leu Leu He Asp Asn 
195 200 205 

Arg Thr Thr Gly Trp Glu Arg Met Gly Arg Glu Val Ala He Ala Leu 

210 215 220 

He Asn Pro Met Arg Phe Leu Asn Arg Leu Thr Ala Gly Glu Val Thr 
20 225 230 235 240 

Ser Val Gly Ser Arg Ser Gly Gin He Phe Gin Ser Val Pro He Asn 

245 250 255 

He Val Val Asp Ala Gly Phe Arg Phe Leu Ala Asp Lys Arg His Ala 
260 265 270 

25 Arg Thr Gly Ala Thr Ala Leu Thr Leu Asn Leu Arg Phe Asp Tyr Gly 
275 280 285 

Asp Pro Phe Arg Ser Glu Thr Phe Ser Pro Tyr Asp Phe Phe Gin Phe 

290 295 300 

Lys Ala Gly Leu Ser Phe Ser Glu Ser Gin Pro Leu Leu Ser Gin He 
30 305 310 315 320 

Asn Leu He Gly He Leu Ser Gly Cys Gin Leu Leu Ala His Glu Arg 

325 330 335 

Thr Val Leu Val Gly Gly Leu Phe Gin His Phe Asp Tyr Tyr Asn Ser 
340 345 350 

35 Glu Lys Arg He Ser Lys Asn Ser Glu Glu Val Leu Val Thr Pro Tyr 
355 360 365 



109 



Arg lie Ser Gin Val Ala Ala Leu Gly Gly Gly Leu lie Phe Gin His 

370 375 380 

His Gly Lys Phe Arg Arg Arg Pro Leu Glu Leu Tyr Ala Glu Thr Tyr 
385 390 395 400 

Leu Asn Val Val Pro Met Gly Ala Ser Leu Ser Asp His Tyr Asn Val 

405 410 415 

Asp Asn Arg Asp Tyr Asn Leu Gly Ser Gly Leu Ser Gly Lys Leu Tyr 

420 425 430 

Leu Gly Ala Thr Tyr Asn Asp Leu Trp Ser Trp Leu Leu Gly Val Glu 

435 440 445 

Ser Tyr Arg Leu Tyr Thr Trp lie Gly Tyr Glu Glu Pro His Gin Lys 

450 455 460 

Asn Thr Asp Val Ser Ser Phe Met Val Gin Gly Asp Glu Ser Lys Ala 
465 470 475 480 

Arg Leu Leu Val Thr Ser Ser Glu Phe Ala Phe His Pro Gly Pro Trp 

485 490 495 

His Val Ala lie Val Ala Arg Arg Phe lie Arg Lys Thr Ala Tyr Gin 

500 505 510 

Phe Tyr Pro Asn Val Ser Phe Asp Thr Gly Asp lie Gin Leu Arg Val 

515 520 525 

Gly Phe His Phe 
530 

(2) INFORMATION FOR SEQ ID NO: 57 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE : 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1...300 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

Lys Arg Arg Lys Lys Gin Met Lys Arg Leu lie Val Phe Leu Ala Met 
15 10 15 

10 Gly Gly Leu Leu Phe Thr Leu Ala Asn Ala Gin Glu Ala Asn Thr Ala 
20 25 30 

Ser Asp Thr Pro Lys Lys Asp Trp Thr lie Lys Gly Val Thr Gly Leu 

35 40 45 

Asn Ala Ser Gin Thr Ser Leu Thr Asn Trp Ala Ala Gly Gly Glu Asn 
15 50 55 60 

Thr Val Ala Gly Asn Leu Tyr Leu Asn lie Asp Ala Asn Tyr Leu Lys 
65 70 75 80 

Asp Lys Trp Ser Trp Asp Asn Gly Leu Arg Thr Asp Phe Gly Leu Thr 
85 90 95 

20 Tyr Thr Thr Ala Asn Lys Trp Asn Lys Ser Val Asp Lys lie Glu Leu 
100 105 110 

Phe Thr Lys Ala Gly Tyr Glu lie Gly Lys His Trp Tyr Gly Ser Ala 

115 120 125 

Leu Phe Thr Phe Leu Ser Gin Tyr Ala Lys Gly Tyr Glu Lys Pro Ser 
25 130 135 140 

Asp His Leu Thr Gly Val Lys His lie Ser Asn Phe Phe Ala Pro Ala 
145 150 155 160 

Tyr Leu Thr Leu Gly lie Gly Ala Asp Tyr Lys Pro Asn Glu Lys Phe 
165 170 175 

30 Ser Leu Tyr Leu Ser Pro Thr Thr Gly Lys Leu Thr Val Val Ala Asp 
180 185 190 

Asp Tyr Leu Ser Ser Leu Gly Ala Phe Gly Val Lys Val Gly Glu Lys 

195 200 205 

Thr Met Phe Glu Leu Gly Ala Leu Val Val Gly Ser Ala Asn lie Asn 
35 210 215 220 

Leu Met Glu Asn Val Asn Leu lie Thr Lys Ala Ser Phe Phe Ser Ala 



Ill 



225 

Tyr Thr His Asp Phe 
245 

Ala Met Lys lie Asn 
260 

lie Tyr Asp Asp Asp 
275 

Lys Glu Val Val Gly 
290 



230 235 
Gly Asn lie Asp lie Asn 
250 

Lys Phe Leu Thr Ala Thr 
265 

Val Lys lie Asn Asp Gly 
280 

Val Gly Val Ala Tyr Thr 
295 



240 

Trp Glu Ala Met Leu 
255 

lie Ala Thr Asn Leu 
270 

Pro Lys lie Gin Phe 
285 

Phe 
300 



(2) INFORMATION FOR SEQ ID NO: 58 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . . 973 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 



Phe Tyr Thr Gly lie Asp Leu His 
1 5 

Phe Pro Leu Leu Leu Leu lie Leu 
20 

Lys Glu Lys His Ser Val Thr Glu 



lie Glu Ser Gin Met Lys Lys Leu 

10 15 
Ser lie Leu Val Gly Cys Gly Lys 
25 30 
lie Ala Unk Glu Lys Lys Arg lie 
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Thr Ala Leu Leu Tyr Glu Lys Glu Leu Pro Thr Asp Ser Val Lys Gin 

50 55 60 

Leu Tyr Glu Asn Ser Val Gin Asn Lys Asn Leu Val Gly Gin Met Leu 
65 70 75 80 

5 Phe Ala lie Glu Val Gly Lys Arg Met Arg Asn Met Ser Gin Tyr Thr 

85 90 95 

Asp Ala Met Leu Tyr His Gin Lys Gly Leu Asn Ala Ala Leu Asn Leu 

100 105 110 

Arg Asp Thr lie Val Ala Ala Gin Ala Trp Asn His Leu Gly Thr Asp 
10 115 120 125 

Ser Arg Arg lie Gly Ala Leu Ala Glu Ala Ser Asp Tyr His Tyr Lys 

130 135 140 

Ala Leu Ser Leu lie Glu Ser Phe Ser Gly Asn Gin Asn Arg Pro Ala 
145 150 155 160. 

15 lie Lys Ala Arg Ser Ala Ala Leu Asn Gly lie Gly Asn lie Asn Leu 

165 170 175 

Glu Leu Gly Tyr His Asp Glu Ala Glu Lys Asn Phe Leu Lys Ala Leu 

180 185 190 

Gin Gly Glu Lys Glu Leu Asp Ser Pro Leu Gly Gin Ala lie Asn Tyr 
20 195 200 205 

Ala Asn Leu Gly Arg lie Tyr Arg Gin Arg Lys Glu Tyr Asp Lys Ala 

210 215 220 

Arg Thr Tyr Phe Leu Leu Ser Leu Glu Gin Asn Asn Met /Via Glu Asn 
225 230 235 - 240 

25 Leu Met Gly lie Gly Leu Cys Ser lie Asn Leu Gly Glu Val Asp Glu 

245 250 255 

Glu Lys Gly Asp Tyr Gin Lys Ala Leu Gin Glu Tyr Ala Thr Ala Tyr 

260 265 270 

Lys Leu Met Glu Gin Leu Ser Asp Arg Trp His Trp Leu Asn Ser Cys 
30 275 280 285 

lie Pro Met Ala Arg lie Asn Leu Lys Gin Gly Asn Glu Arg Leu Tyr 

290 295 300 

Gin His Phe He Ser Leu Ala Glu Gly Thr Ala Lys Glu He Asn Ser 
305 310 315 320 

35 Thr Ser His Leu He Glu He Tyr Asn Leu Gin Tyr Glu Asn Leu Glu 

325 330 335 



113 



Arg Lys Lys Glu Tyr Lys Gin Ala Leu Glu Ala Phe Cys Leu Ser Lys 

340 345 350 

Thr Leu Ser Asp Ser Met Ser lie Ala His Lys Val Ser Ser lie Gin 

355 360 365 

Glu Thr Arg Phe Asn Tyr Glu Arg Asn Lys Ser Gin Lys Glu Leu Glu 

370 375 380 

Glu lie Gin Gin Val Ser Lys Ala Lys Gin Glu Lys Ser Lys Phe lie 
385 390 395 400 

Leu Leu Ser Thr Leu Phe Ala Leu Phe lie Ser lie Leu Leu lie Ser 

405 410 415 

Val Leu Thr Tyr Ala Tyr Arg Gin Gly Lys Lys His Asn Lys Leu lie 

420 425 430 

Lys Glu Thr Asp Lys Leu Arg Ser Gly Phe Phe Thr Gly lie Thr His 

435 440 445 

Glu Phe Arg Thr Pro lie Thr Val lie Gin Gly Leu Asn Glu Lys Met 

450 455 460 

Ser Ser Ser Pro Asp Leu Gin Ala Ser Asp Arg Thr Glu Leu His Lys 
465 470 475 480 

lie lie Asp Arg Gin Ser Ser His Met Leu Asn Leu Val Asn Gin Leu 

485 490 495 

Leu Asp lie Cys Lys lie Arg Ser Gly Val Ser Thr Pro Glu Trp Arg 

500 505 510 

Asn Gly Asp lie Val Ser Phe Val Gin lie Leu lie Asp Ser Phe Ala 

515 520 525 ' 

Pro Tyr Ala Gin Ala Gin Asp lie Thr Leu Glu Leu Gin Pro Glu Ser 

530 535 540 

Lys Pro lie Val Val Asp Phe Val Pro Ser Tyr Leu Gin Lys lie lie 
545 550 555 560 

Ser Asn Leu Leu Ser Asn Ala lie Lys Tyr Ser Leu Ala Gly Gly Arg 

565 570 575 

Val Val lie Ser Leu Ala Lys Thr Lys Asn Glu Lys Asn Leu lie lie 

580 585 590 

Arg Val Ala Asp Asn Gly lie Gly lie Asp Lys Thr Asp Gin Ala His 

595 600 605 

lie Phe Asp lie Phe Tyr Arg Gly Gin Ser Ala Thr Glu Lys His Gly 
610 615 620 
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Ser Gly Val Gly Leu Ser Phe Thr Asn lie Leu Val Glu Asn Leu Arg 
625 630 635 640 

Gly Thr lie Lys Val Glu Ser Gin Pro Gly Lys Gly Ser Ala Phe Thr 
645 650 655 

5 lie Ser lie Pro Thr Gin Asn Gin Ser Ser Ser Ala Glu lie Leu Pro 
660 665 670 

Trp Leu Pro Ser Ser Asp Asp lie Val Met Pro Val His lie Ala Pro 

675 680 685 

Asp Asp Ser Pro Thr Ser Pro Met Val Ala Ala Leu Asn His Arg Phe 
10 690 695 700 

Glu Asp Glu Arg Pro Thr lie Leu Leu Val Glu Asp Asn Lys Asp lie 
705 710 715 720 

Asn Leu Leu Val Lys Leu Leu Leu Cys Asp Arg Tyr Asn Val Leu Ser 
725 730 735 

15 Ala Ala Asn Gly Lys Glu Gly lie Ala Leu Ala Thr Glu His lie Pro 
740 745 750 

Asp lie lie He Thr Asp He Met Met Pro He Met Asp Gly He Glu 

755 760 765 

Met Thr lie Arg Met Lys Gin Ser Pro Leu Leu Cys His He Pro He 
20 770 775 780 

Val Ala Leu Thr Ala Lys Ser Thr Glu Gin Asp Arg Leu Glu Gly He 
785 790 795 800 

Lys Ser Gly Val Val Ser Tyr Leu Cys Lys Pro Phe Ser Pro Glu Glu 
805 810 - 815 

25 Leu Leu Met Arg He Glu Gin Leu Leu Lys Asp Arg Glu Leu Leu Lys 
820 825 830 

Lys Phe Tyr Met Gin Lys Leu Met Leu Asp Arg Lys Pro Glu Glu Glu 

835 840 845 

Pro Gin Pro He Asp Asp Ser Ser Met Gin Phe Leu Leu Ala Ala Lys 
30 850 855 860 

Asp Ala Val Ser Gly Gly He Lys Gin Asn Pro Asp Phe Ser Ala Gin 
865 870 875 880 

Asp Leu Ala Glu Lys Met Cys Met Ser Pro Ser Gin Leu Asn Arg Lys 
885 890 895 

35 Leu Thr Ser Val Val Gly Cys Ser Thr He Gly Tyr He Gin Gin He 
900 905 910 
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Lys lie Lys Leu Ala 
915 

Ser Asp lie Ser lie 
930 

Arg Thr Phe Lys Arg 
945 

Lys Leu Leu Ala Met 
965 



Cys Lys Leu Leu Ala 
920 

Glu Ala Gly Phe Ser 
935 

Tyr Met Asn Cys Ser 
950 

Pro Gly Ser Asp Lys 
970 



Asp Glu Ser Lys Asn II 
925 

Asp Pro Ala Tyr Phe Se 
940 

Pro Ser Gin Tyr Arg Gl 
955 96 
Glu Thr Val 



(2) INFORMATION FOR SEQ ID NO: 59 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...227 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 



Tyr Lys Gin Ala 
1 

Leu Cys Ala Leu 
20 

Asn Thr Glu Lys 
35 

Tyr He Ala Gly 



He Met Lys His 
5 

Ser Phe Ser Gly 

Ser Arg Phe Asp 
40 

Ser Thr Thr Asn 



Leu Phe Lys Ser 
10 

Thr Tyr Thr Phe 
25 

Phe Ser Val Arg 
Leu Met Tyr Gly 



Thr Leu Val Leu 
15 

Ala Gin Glu Asn 
30 

Leu Gly Gin Gly 
45 

Tyr Thr Ser Ala 
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50 55 60 

Asn Asp Arg Leu Leu Ser Gly Ala lie Tyr Leu Gly Leu Thr Pro Ser 
65 70 75 80 

Lys Lys Glu Asn Ala Thr Gly Val Ala Phe Arg Phe Leu Ser Pro Ser 
5 85 90 95 

Pro Gly Tyr Tyr Val Asp lie Ser Gly Lys Glu Asn Thr Leu Asn Tyr 

100 105 110 

Ala Phe Tyr Val Val Gly Ala Tyr Asn Arg lie Ala lie Pro lie Arg 
115 120 125 

10 Pro lie Lys Asn Phe Asn Phe lie Phe Ser Thr Glu Val Gly Met Ala 
130 135 140 

Trp Met Ser Arg His Glu Gin lie Tyr Asn Ser Thr Ser Gin Thr Trp 
145 150 155 160 

Asp Lys Gin Arg Lys Ser Arg Ser Gly Leu Asp Phe Gly Leu Gly Met 
15 165 170 175 

His Leu Gin Unk His lie Asn Lys Thr Val Tyr Phe Met Ala Gly Thr 

180 185 190 

Asp Leu Thr Ser Cys Met Phe Gly Lys Arg lie Asn Asp Tyr Gin Gin 
195 200 205 

20 Lys Asp Arg Thr Phe lie Ala Leu lie Asp Asn Ser lie Gly lie Gly 
210 215 220 

Leu Asn Leu 
225 

25 (2) INFORMATION FOR SEQ ID NO: 60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



35 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 
5 (B) LOCATION 1...235 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

Met Lys Thr lie Ser Lys Asn His Ala Ala Arg lie Cys Ala Ala lie 
10 1 5 10 15 

Ala Leu Phe Ala Val Cys Asn Gly Arg lie Ala Ala Gin Asp Phe Leu 

20 25 30 

Tyr Glu lie Gly Gly Gly Phe Gly Ala Ala Gin Tyr Phe Gly Asp Ala 
35 40 45 

15 Asn Arg Gly Leu Phe Gly Ser Ser Gly Val Gly Leu Glu Leu Val Gly 
50 55 60 

Arg Tyr Asn Tyr Asn Phe Arg Trp Ala Phe Ser Thr Met Leu Asp Trp 
65 70 75 80 

Arg Thr Leu Arg Gly Asp Thr Asp Lys Ser Gly Asn Val Phe Pro Asp 
20 85 90 95 

Phe Ala Gin Ala Asp Phe Lys Val Gly Leu Thr Gin Leu His Val Arg 

100 105 110 

Ser Glu Phe Asn Phe Leu Pro Tyr Ser Asp Gly Tyr Lys Tyr Leu Gly 
115 120 125 ' 

25 Thr Ala Arg Leu Ser Pro Tyr Val Ala Ala Gly Leu Ser Leu Gly Phe 
130 135 140 

Ala Ser Gly Ala Lys Gly Ser Ala Phe Ala Pro Gly lie Thr Ala Gly 
145 150 155 160 

Met Gly Val Lys Tyr Lys Leu Lys Pro Arg lie Asn Val Gly lie Glu 
30 165 170 175 

Tyr Ser Phe Thr Gly Leu Leu Thr Asp Ala Leu Asp Ala Leu Thr Asp 

180 185 190 

Lys Ser Val Trp Leu Glu Asp Pro Tyr Lys lie Asn Asp Ser Trp Val 
195 200 205 

35 Lys Asn Lys Asp Ala Thr Gly Ala Leu Val Leu Arg lie Thr Tyr Asp 
210 215 220 
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5 



10 



Phe Gly Leu Arg Lys Thr Phe Cys Asn Lys Gin 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 61 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1162 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 
20 (B) LOCATION 1...1162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

Ala lie Ser Gin Met Lys Arg lie Leu Pro lie Val Ala Phe Leu Ser 
25 1 5 10 15 

Leu Phe Leu Ala Leu Ala Leu Pro Ala Lys Ala Gin Arg Ala Met Gly 

20 25 30 

Lys Thr Ala Asp Arg Ser Leu Met Ala Ser Gly His Trp Val Lys lie 
35 40 45 

30 Arg Val Asp Ala Ser Gly Val Tyr Arg Leu Thr Asp Glu Gin Leu Arg 
50 55 60 

Ala Asn Gly Phe Ser Asp Pro Ser Lys Val Gly Val Phe Gly Tyr Gly 
65 70 75 80 

Gly Gly Val Leu Pro Glu Asp Leu Ser Arg lie Thr Thr Asp Asp Leu 
35 85 90 95 

Pro Pro Val Pro Val Leu Arg Gin Gly Asn Ala Leu Tyr Phe Tyr Ala 
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100 105 110 

Val Gly Pro Val Thr Trp Phe Tyr Asn Pro Ala Lys Thr Thr Met Glu 

115 120 125 

His Thr Val Asn Thr Tyr Ser Thr His Gly Tyr Tyr Phe Leu Ser Asp 

130 135 140 

Ala Ala Gly Ala Pro Leu Gin Met Ser Gin Tyr Thr Gly Gly Gly Ala 
145 150 155 160 

Ser Ala Glu Ala Leu lie Asp Tyr Tyr Asp Glu Leu Met Leu His Glu 

165 170 175 

Gin Glu Leu Tyr Ser Pro Lys Glu Ser Gly Arg Asp Leu Tyr Gly Glu 

180 185 190 

Ser Phe Ser Ala Val Asn Thr Arg Thr Val Lys Phe Pro Leu Arg Gly 

195 200 205 

Asn Thr Arg Ser Ser Gly Glu Leu Gly Thr Val Phe Ser Tyr lie Ala 

210 215 220 

Lys Ala Arg Ser Ala Gly Gly Gly Arg Glu Met Ser Leu Ser Ala Asn 
225 230 235 240 

Gly lie Leu lie Phe Ser Asp Pro Phe Ser Met Thr Ser Asn Glu Val 

245 250 255 

Ser Asn Ser Tyr Leu Ala Gly Lys Lys Arg Arg Leu Tyr His Ser Thr 

260 265 270 

Pro Met Asn Ser Leu Val Asn Glu Leu Arg Leu Asp Ala Asn Tyr Ser 

275 280 285 

Met Thr Gly Asp Ala Val Asn Leu Asp Phe lie Glu Val Ala Thr Gin 

290 295 300 

Asn Asp Leu Arg Tyr Asp Gly Ala Pro Met His lie Arg Arg Phe Ser 
305 310 315 320 

Asn Leu Pro Val Leu Gly Gly Glu Ser Cys Arg Phe Val lie Ser Glu 

325 330 335 

Val Pro Glu Ser Leu Val Val Leu Gin Ala Asn Ser Ser Leu Thr Ala 

340 345 350 

Ser Leu Val Pro Val Lys Thr Val Gly Asp Lys Thr lie Glu Phe Val 

355 360 365 

Ala Pro Pro Lys Gly Gin Asp Arg Arg Thr lie Asn Thr Phe Tyr Ala 

370 375 380 

Val Asp Leu Ser Gin Ala Ser Ala Pro Glu lie Leu Gly Ala Val Pro 



120 



385 390 395 400 

Asn Gin Asn Leu His Gly Glu Glu lie Pro Asp Leu lie lie Val Ser 

405 410 415 

Thr Gin Ala Leu Leu Leu Glu Ala Asp Arg Leu Ala Thr Tyr Arg Arg 
5 420 425 430 

Glu Lys Asn Gly Leu Lys Val Leu Val Val Leu Gin Glu Gin Val Phe 

435 440 445 

Asn Glu Phe Ser Gly Gly Thr Pro Asp Ala Thr Ala Tyr Arg Leu Phe 
450 455 460 

10 Ala Lys Met Phe Tyr Asp Arg Trp Lys Ala Asn Ala Pro Val Gly Glu 
465 470 475 480 

Thr Phe Pro Met Gin Met Leu Leu Phe Gly Asp Gly Ala His Asp Asn 

485 490 495 

Arg Lys Val Ser Val Ala Trp Gin Lys Pro Tyr Leu Gin Gin Thr Glu 
15 500 505 510 

Phe Leu Leu Thr Phe Gin Ala Val Asn Ser Thr Asn Val Asn Ser Tyr 

515 520 525 

Val Thr Asp Asp Tyr Phe Gly Leu Leu Asp Asp Gin Pro Ala Ser Val 
530 535 540 

20 Asn lie Gly Trp Arg Asn Tyr Asn Met Ala Val Gly Arg Phe Pro Val 
545 550 555 560 

Arg Thr Pro Ala Glu Ala Arg lie Ala Val Asp Lys Thr lie Arg Tyr 

565 570 575 

Glu Glu Asp Arg Glu Ser Gly Ala Trp Arg lie Arg Ala Cys Phe Ala 
25 580 585 590 

Ala Asp Asn Gly Asp Lys His Ala Thr Glu Thr Ser Arg Leu lie Asp 

595 600 605 

Thr Val Lys Arg Tyr Ala Pro Ala lie Met Pro Val Arg Ala Phe Gin 
610 615 620 

30 Asp Val Tyr Pro His Val lie Glu Asn Gly Leu His Ser lie Pro Gly 
625 630 635 640 

Ala Lys Lys Lys Met Leu Glu Thr Leu Gin Ser Gly lie lie Leu Leu 

645 650 655 

Asn Tyr Ala Gly His Gly Gly Pro Ala Gly Trp Ser Asp Glu His Leu 
35 660 665 670 

Leu Thr Leu Asn Asp lie His Lys Phe Asn Tyr Lys His Met Pro lie 



121 



675 

Trp lie Thr Ala 
690 

Ser Ala Gly Glu 
705 

Met Phe Ser Thr 

Asn Gly Phe Met 
740 

Tyr Arg Thr Met 
755 

Ser Thr Val Phe 
770 

Gly Asp Pro Ser 
785 

Thr Ala He Asn 

Leu Lys Ser Leu 
820 

Lys Gly Thr Phe 
835 

Phe Asp Gly Arg 
850 

Leu Ser Leu Val 
865 

Ala Glu Val Lys 

Asp Val Asn Tyr 
900 

Asn Glu Ser Thr 
915 

Val Gin Pro Gly 
930 

Glu He He Ser 
945 

Glu Val Asn Pro 



680 

Thr Cys Asp Phe 
695 

Glu Val Phe Leu 
710 

Thr Arg Val Val 
725 

Leu Arg Arg Met 

Gly Glu He He 
760 

Pro Asp Ser He 
775 

Val Arg Met Asn 
790 

Gly Gin Asp Pro 
805 

Glu Arg Val Ala 

Asp Glu Thr Phe 
840 

Lys Lys Met Thr 
855 

Tyr Tyr Asp Tyr 
870 

Asp Gly Leu Phe 
885 

Ser Glu His Glu 

Lys Ala Glu Ala 
920 

He Pro Asp Glu 
935 

Cys Phe Leu Asn 
950 

Thr Pro Leu Phe 



Ala Asn Tyr Asp 
700 

His Glu Lys Ser 
715 

Tyr Asn Thr Gin 
730 

Phe Glu Lys Ala 
745 

Arg Ser Ala Lys 

Asn Gin Leu Ser 
780 

Leu Pro Thr His 
795 

Glu Gly Gin Tyr 
810 

Leu Lys Gly Lys 
825 

Ser Gly Lys Val 

Ala Leu Glu Glu 
860 

Pro Asn Val Met 
875 

Glu Thr Ser Phe 
890 

Gly Arg He Asn 
905 

Met Gly Val Asp 

Val Thr Glu Asp 
940 

Asp Ser Thr Phe 
955 

Met Ala Glu Val 



685 

Ser Gin Thr Thr 

Gly Thr Pro He 
720 

Asn Glu Lys He 
735 

Lys Asp Gly Arg 
750 

Gin Gly Met Leu 
765 

Phe Phe Leu Met 

Lys Val Gin Leu 
800 

Gly Thr He Met 
815 

Val Thr Asp Glu 
830 

Phe Leu Thr Val 
845 

Glu Gly Asn Asp 

Tyr Ala Gly He 
880 

He Val Pro Lys 
895 

Leu Tyr Ala Tyr 
910 

Phe Ser He Arg 
925 

Asn Thr Pro Pro 

Arg Ser Gly Asp 
960 

Phe Asp Leu Asn 
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965 970 975 

Gly lie Asn He Thr Gly Ser Gly Val Gly His Asp He Thr Leu Cys 

980 985 990 

He Asp Gly Arg Ala Asp Leu Thr Tyr Asn Leu Asn Ala Tyr Phe Thr 
5 995 1000 1005 

Ser Ser Ala Thr Asp Ala Gly Val Gly Thr He Leu Phe Met He Pro 

1010 1015 1020 

Ala Leu Ala Glu Gly Asp His Thr Ala Arg Leu Thr Val Trp Asp He 
1025 1030 1035 1040 

10 Phe Asn Asn Ala Val His His Asp Phe Ser Phe Arg Val Val Asp Gly 

1045 1050 1055 

He Ala Pro Asp Val Ala Asp Val He Leu Phe Pro Asn Pro Val Arg 

1060 1065 1070 

Glu Ser Ala Thr Phe Arg He Phe His Asn Arg Pro Gly Ser Asp Leu 
15 1075 1080 1085 

Asn Val Ala Val Glu He Tyr Asp Phe Thr Gly Arg Leu Val Asn Ser 

1090 1095 1100 

Leu Pro Val Lys Thr Tyr Ser Ser Ser Tyr Gly Glu Pro He Glu He 
1105 1110 1115 1120 

20 Lys Trp Asp Leu Thr Ser Lys Tyr Gly Val Lys He Gly Asn Gly Phe 

1125 1130 1135 

Tyr Leu Tyr Arg Cys Val Val Asn Ser Pro Gly Gly Gin Thr Ala Ser 

1140 1145 1150 

Met Ala Lys Lys Met He Val Val Gly Gin 
25 1155 1160 

(2) INFORMATION FOR SEQ ID NO: 62 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 4 06 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

5 (ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

10 

lie Phe lie Asp Pro Asp Lys Asn Thr Lys Gin Asn Glu Arg Asn Met 
15 10 15 

lie lie Lys Lys Met Leu Lys Asn Lys Leu Ala Pro Leu Ala lie Leu 
20 25 30 

15 Phe Leu Phe Ala Pro Lys Ala Met Lys Ala Gin Glu Gin Leu Asn Val 
35 40 45 

Val His Thr Ser Val Pro Ser Leu Asn lie Ser Pro Asp Ala Arg Ala 

50 55 60 

Ala Gly Met Gly Asp lie Gly Val Ala Thr Thr Pro Asp Ala Tyr Ser 
20 65 70 75 80 

Gin Tyr Trp Asn Pro Ser Lys Tyr Ala Phe Met Asp Thr Lys Ala Gly 

85 90 95 

lie Ser Phe Ser Tyr Thr Pro Trp Leu Ser Lys Leu Val Asn Asp lie 
100 105 110 

25 Ala Leu Met Gin Met Thr Gly Phe Tyr Lys Leu Gly Thr Asp Glu Asn 
115 120 125 

Gin Ala lie Ser Ala Ser Leu Arg Tyr Phe Thr Leu Gly Lys Leu Glu 

130 135 140 

Thr Phe Asp Glu Leu Gly Glu Ser Met Gly Glu Ala His Pro Asn Glu 
30 145 150 155 160 

Phe Ala Val Asp Leu Gly Tyr Ser Arg Gin Leu Ser Glu Asn Phe Ser 

165 170 175 

Met Ala Val Ala Leu Arg Tyr lie Arg Ser Asp Gin Ser Thr His Asn 
180 185 190 

35 Thr Gly Glu Asn Gin Ala Gly Asn Ala Phe Ala Ala Asp lie Ala Gly 
195 200 205 
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Tyr Leu Gin Lys Tyr Val Leu Leu Gly Asn Ala Glu Ser Leu Trp Ser 

210 215 220 

Leu Gly Phe Asn Val Lys Asn lie Gly Thr Lys lie Ser Tyr Asp Gly 
225 230 235 240 

5 Gly Val Thr Ser Phe Phe lie Pro Thr Ser Leu Asn Leu Gly Thr Gly 

245 250 255 

Leu Leu Tyr Pro lie Asp Asp Tyr Asn Ser lie Asn Phe Asn Leu Glu 

260 265 270 

Leu Ser Lys Leu Leu Val Pro Thr Pro Pro lie Met Asp Gin Asn Asp 
10 275 280 285 

Gin Ala Gly Tyr Glu Ala Ala Leu Lys Lys Tyr Gin Glu Thr Ser Ser 

290 295 300 

lie Ser Gly lie Phe Ser Ser Phe Gly Asp Ala Pro Gly Gly Leu Lys 
305 310 315 320 

15 Glu Glu Phe Arg Glu lie Thr Trp Gly Leu Gly Ala Glu Tyr Ser Tyr 

325 330 335 

Asp Asp Lys Phe Phe Val Arg Ala Gly Tyr Ser Tyr Leu His Pro Thr 

340 345 350 

Lys Gly Asn Leu Gin Tyr Phe Thr Ala Gly Ala Gly Phe Lys Met Asn 
20 355 360 365 

lie Phe Arg lie Asp Ala Ser Tyr Leu Leu Ser Thr lie Gin Ser Asn 

370 375 380 

Pro Leu Asp Gin Thr Leu Arg Phe Thr Leu Ala Phe Asp Met Asp Gly 
385 390 395 - 400 

25 Leu Arg Asn Leu Phe His 

405 

(2) INFORMATION FOR SEQ ID NO: 63 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1. . .221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

Thr Arg Glu Ser Val Leu His Cys Arg Thr Lys Leu Lys Lys Glu Arg 

15 10 15 

Lys Met Lys Lys Met He Leu Ala Ala Thr Met Leu Leu Ala Thr He 

20 25 30 

Gly Phe Ala Asn Ala Gin Ser Arg Pro Ala Leu Arg Leu Asp Ala Asn 

35 40 45 

Phe Val Gly Ser Asn Leu Met Gin Lys Val Ala Asn Thr Ser Val Asn 

50 55 60 

Asn Lys Met He Val Gly Leu Arg Val Gly Ala Ala Ala Glu Phe Ala 
65 70 75 80 

Leu Ser Asn Asp Gly Phe Tyr Leu Ala Pro Gly Leu Ala Tyr Thr Met 

85 90 95 

Arg Gly Ala Lys Met Glu Ser Leu Ser Glu Thr Thr Thr Arg Leu His 

100 105 HO 

Tyr Leu Gin He Pro Val Asn Ala Gly Met Arg Phe Ser Phe Ala Asp 

115 120 125 

Asn Met Ala He Ser Leu Glu Ala Gly Pro Tyr Phe Ala Tyr Gly Val 

130 135 140 

Ala Gly Thr He Lys Thr Lys Val Ala Gly Val Thr Ala Ser Val Asp 
145 150 155 160 

Ala Phe Gly Asp Asn Gly Tyr Asn Arg Phe Asp Leu Gly Leu Gly Leu 

165 170 175 

Ser Ala Ala Leu Ser Tyr Asp Arg Tyr Tyr Val Gin He Gly Tyr Glu 

180 185 190 

His Gly Leu Leu Asn Met Leu Lys Asp Ala Pro Asp Lys Thr Ser Leu 
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195 200 205 

Arg Asn His Asp Phe Phe Val Gly Leu Gly Val Arg Phe 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 64 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 67 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 ... 467 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 

Lys Gly Thr Trp Asn Arg Lys Asn Arg Arg Glu Glu Met Lys Arg Thr 

15 10 15 

lie Leu Leu Thr Ala Leu Thr Val Leu Ser Ser Leu Ser Leu Leu Arg 

20 25 30 

Ala Gin Asn Glu Ser Glu Ala Ser Thr Asn Pro Met Ser Gly Leu Ser 

35 40 45 

Leu Glu Asp Cys lie Arg lie Ala Lys Glu Arg Asn Leu Asn Leu Arg 

50 55 60 

Arg Gin Glu lie Glu Gin Glu Asn Arg lie lie Ser Leu Asp Ala Ala 
65 70 75 80 

Arg His Ser Phe Leu Pro Ser Val Asn Ala Gly lie Gly His Asn Tyr 
85 90 95 



127 



Ser Phe Gly Arg Ser Lys Asp Lys Thr Gly Val Thr Val Asp Arg Ser 

100 105 110 

Ser Met Asn Thr Asn Leu Ser lie Gly Ala Ser Val Glu Val Phe Ser 

115 120 125 

Gly Thr Arg Arg Leu His Asp Leu Lys Gin Gin Lys Tyr Asn Val Glu 

130 135 140 

Asp Gly lie Ala Arg Leu Gin Lys Ala Arg Glu Asp Leu Ser Leu Gin 
145 150 155 160 

lie Ala Ala Leu Tyr lie Asn Leu Leu Phe Arg Gin Glu Met Thr Arg 

165 170 175 

Thr Ala Glu Thr Gin Leu Ala Leu lie Arg Glu Gin Arg Asn Arg Thr 

180 185 190 

Ala Glu Met Val Arg Val Gly Lys Trp Ala Glu Gly Lys Leu Leu Asp 

195 200 205 

lie Asn Ala Gin Met Ala Lys Asp Glu Gin Leu Leu Val Gin Tyr Arg 

210 215 220 

Ser Glu Glu Glu Leu Ala Arg Leu Asp Leu Gly Gin Ala Leu Glu Leu 
225 230 235 240 

Glu His Pro Glu Ser lie Ala Val Lys Ala Pro Asp Thr Asp Val Leu 

245 250 255 

Val Ala Glu Arg Leu Gly Ser Leu Leu Ala Pro Glu Glu lie Tyr Arg 

260 265 270 

Thr Ala Leu Gly Leu Lys Pro Ala Leu His Ser Ser Glu Leu Gin lie 

275 280 285 ' 

Ala Ser Ala Arg Glu Gly Leu Ala Ser Ala Arg Ala Ala Tyr Phe Pro 

290 295 300 

Thr Leu Ser Leu Ser Ala Gly Tyr Ser Asn Gly Tyr Phe Arg Asp Leu 
305 310 315 320 

Gly Lys Glu Tyr Ala Ala lie Asn Pro Ser Phe Ser Glu Gin Trp Lys 

325 330 335 

Asn Asn Gly Ser Tyr Ser lie Gly Leu Ser Leu Asn lie Pro lie Phe 

340 345 350 

Ser Ala Met Gin Thr Gin Asp Arg Val Arg Ser Ser Arg Leu Gin lie 

355 360 365 

Arg Ser Ser Glu Leu Arg Leu Val Glu Glu Lys Lys Ala Leu Tyr Lys 
370 375 380 
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Glu lie Arg Gin Ala 
385 

Ala Ala Ala Glu Asn 
405 

Ala Arg Asp Ser Phe 
420 

Glu Ala Lys Thr Lys 
435 

Lys Tyr Asp Phe lie 

450 
Lys Asp Phe 
465 



Tyr Ser Asn Ala Val 
390 

Ser Lys Ala Ala Thr 
410 

Glu Ala Gly Arg Leu 
425 

Tyr Ala Leu Ser Gin 
440 

Tyr Lys Ala Lys Val 
455 



Ala Ala Asp Lys Ala lie 
395 400 
Leu Lys Ala Tyr Glu Tyr 
415 

Ser Ala Tyr Glu Tyr Ala 
430 

Val Glu Glu Leu Arg Ala 
445 

Leu Asp Phe Tyr Gin Gly 
460 



(2) INFORMATION FOR SEQ ID NO: 65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1...436 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 



Asn lie Met Tyr Lys Asp Tyr Lys Gly Leu Tyr Ala Ser Leu Arg Trp 

15 10 15 

Tyr Ala Leu He He Gly Leu Leu Phe Ala Ala Asp Gly He Gin Ala 
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20 

Gin Asn Asn Asn 
35 

Arg Leu Gly Glu 
50 

Gly Val Gly Leu 
65 

Ser Tyr Ser Ala 

Ser Thr Gly lie 
100 

Lys Met Gly Asn 
115 

Ser lie Ala Met 
130 

Gin Phe Gly Ser 
145 

Arg Lys Tyr Leu 

Gly Ala Thr Pro 
180 

Leu Phe Gly Arg 
195 

Ala Pro Tyr Asn 
210 

Lys Phe Asp Phe 
225 

Ser Leu Val lie 

Glu Leu Thr Gin 
260 

Ser Glu Thr Gin 
275 

His Thr Leu Gly 
290 

Leu Gly Ala Asp 



Phe Thr Glu Ser 
40 

Arg Thr Thr lie 
55 

Arg Gin Gly Thr 
70 

Val Asp Ser Met 
85 

Thr Trp Tyr Ala 

lie Glu Tyr Phe 
120 

Ser Ala Gly Val 
135 

Val Asp Gin Val 
150 

Gly Thr Gly Asn 
165 

Phe Lys Asn Phe 

Phe Thr His Ser 
200 

Pro Val His Leu 
215 

Gly Met Gin Tyr 
230 

Gly Ala Val Tyr 
245 

lie Lys Asn Gin 

Glu Tyr lie Lys 
280 

lie Gly Phe Ser 
295 

Val Gin Tyr Ser 



25 

Pro Tyr Thr Arg 

Ser Gly His Ser 
60 

Tyr Val Asn Ala 
75 

Thr Phe lie Phe 
90 

Glu Asn Gly Lys 
105 

Ala Met Leu Phe 

Leu Pro Tyr Ser 
140 

Glu Gly Gly Ser 
155 

Leu Asn Asp Leu 
170 

Ser lie Gly Ala 
185 

Arg Gin Val lie 

Ser Thr Leu Tyr 
220 

His Leu Leu Leu 
235 

Ser Pro Arg Val 
250 

Val Gin Asn Gly 
265 

Gly Met Asp Tyr 

Tyr Glu Lys Lys 
300 

Lys Trp Lys Gly 



30 

Phe Gly Leu Gly 
45 

Met Gly Gly Leu 

Val Asn Pro Ala 
80 

Asp Phe Gly Ala 
95 

Lys Asp Asn Arg 
110 

Pro lie Ser Lys 
125 

Ala Ser Gly Tyr 

Val Gin Tyr Thr 
160 

Tyr Val Gly lie 
175 

Asn Ala Ser Ser 
190 

Phe Ser Thr Glu 
205 

Leu Lys Ala Ala 

Lys Ser Asp Arg 
240 

Lys Met His Ser 
255 

Val Val Val Glu 
270 

Tyr Thr Leu Pro 
285 

Asp Lys Leu Leu 
Glu Lys Phe Tyr 



130 



305 310 315 320 

Lys Ser Asp Cys Lys Phe Gin Asp Arg lie Arg Val Ser Leu Gly Gly 

325 330 335 

Glu lie lie Pro Asp lie Asn Ala Val Gly Met Trp Pro Lys Val Arg 

340 345 350 

Tyr Arg Phe Gly Leu His Gly Glu Asn Ser Tyr Leu Lys Val Pro Thr 

355 360 365 

Lys Gly Gly Val Tyr Gin Gly Tyr His lie Val Gly Ala Val Phe Gly 

370 375 380 

lie Gly lie Pro Leu Asn Asp Arg Arg Ser Phe Val Asn Val Ser Leu 
385 390 395 400 

Glu Tyr Asp Arg Leu lie Pro Lys Glu Gly Met lie Lys Glu Asn Ala 

405 410 415 

Leu Lys Leu Thr Phe Gly Leu Thr Phe Asn Glu Ser Trp Phe Lys Lys 

420 425 430 

Leu Lys Leu Asn 
435 

(2) INFORMATION FOR SEQ ID NO: 66 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 94 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 

(B) LOCATION 1. . .945 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

Asn Thr lie Ser Glu Asn Gly Asn Asp Ser Thr Tyr Phe Ser Phe Gin 

15 10 15 

Thr Phe Leu Met Arg Ser lie Tyr Gin Leu Leu Leu Ser lie Leu Leu 

20 25 30 

Ala Ser Leu Gly Phe Val Gly Leu Glu Ala Gin Gin Ala Gly Val Ala 

35 40 45 

Gly Arg Val Leu Asp Glu Glu Gly Asn Pro Met lie Gin Ala Asn Val 

50 55 60 

Gin Leu Val Gin Ser Thr Gly Gin Val Ala Val Ala Ala Gly Ala Thr 
65 70 75 80 

Asn Glu Lys Gly Leu Phe Ser Leu Lys Thr Ser Gin Glu Gly Asp Tyr 

85 90 95 

lie Leu Arg Val Ser Tyr Val Gly Tyr Thr Thr His Asp Glu Lys lie 

100 105 110 

Ser Leu Arg Asn Gly Gin Thr lie Thr Leu Lys Asp lie Ser Met Asn 

115 120 125 

Glu Asp Ala Arg Leu Leu Gin Ser Val Thr Val Gin Ala Lys Ala Ala 

130 135 140 

Glu Val Val Val Arg Asn Asp Thr Leu Glu Phe Asn Ala Gly Ser Tyr 
145 150 155 160 

Thr Val Ala Gin Gly Ala Ser lie Glu Glu Leu lie Lys Lys Leu Pro 

165 170 " 175 

Gly Ala Glu lie Gly Ser Asp Gly Lys lie Thr lie Asn Gly Lys Asp 

180 185 190 

lie Ser Lys lie Leu Val Asp Gly Lys Glu Phe Phe Ser Lys Asp Pro 

195 200 205 

Gin Val Ala lie Lys Asn Leu Pro Ala Asp Met Val Asn Lys Val Gin 

210 215 220 

Val Leu Asn Lys Leu Ser Glu Leu Ser Arg Met Ser Gly Phe Asp Asp 
225 230 235 240 

Gly Glu Glu Glu Thr Val lie Asn Leu Thr Val Lys Pro Glu Lys Lys 

245 250 255 

Lys Gly Leu Phe Gly Thr Leu Gin Ala Gly Tyr Gly Thr Asp Gin Arg 
260 265 270 
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Tyr Met Ala 
275 

Thr Leu lie 
290 

5 Met Asp Ser 
305 

Gly Arg Arg 
Leu Gly Gly 

10 

Gly Asp Ala 
355 

Arg Val Glu 
370 

15 lie Leu Glu 
385 

Gin Trp Lys 

Ser lie Ser 

20 

Asp Ala Thr 
435 

Gin Gly Asn 
450 

25 Leu Asn Asp 
465 

Thr Asp Glu 
Glu Thr Asn 

30 

Leu Arg Leu 
515 

Ala lie Leu 
530 

35 Tyr Arg Leu 
545 



Gly Gly Asn Val 

Gly Ser Ala Asn 
295 

Glu Met Gly Ser 
310 

Gly Phe Gly Asn 
325 

Asn Phe Ser Val 
340 

Arg Tyr Gly Tyr 

Asn lie Leu Ala 
375 

Arg Ser Phe Ser 
390 

Pro Ser Glu Arg 

405 

Lys lie Asp Gly 
420 

Gly lie Ser lie 

Asn Phe Arg Leu 
455 

Glu Gly Arg Thr 
470 

Asp Gly Asp Gly 
485 

Gin Lys Gin Phe 
500 

Ser Tyr Val Glu 

Asn Arg Arg Phe 
535 

Gly Asp Asp Gly 
550 



Asn Arg Phe Asp 

280 

Asn Thr Asn Asn 

Met Thr Phe Phe 
315 

Ser Gly Gly Val 
330 

Glu Phe Ser Ser 
345 

Asn Asp Lys Ala 
360 

Glu Gly Asn Thr 

His Asn Gly Gin 

395 

Thr Glu Val Val 
410 

Phe Phe Asn Asp 
425 

Asn Lys Gly Ser 
440 

Asn Gly Glu Leu 

lie Ser Ala Ser 
475 

lie Tyr Gin Ala 
490 

Asn Asp Asn Ser 
505 

Pro Leu Gly Lys 
520 

Ser Arg Arg Asn 

Gin Tyr Ser lie 
555 



Gly Asn Lys Gin Trp 
285 

Met Gly Phe Ser Glu 
300 

Ser Pro Gin Gly Gly 
320 

Thr Ser Ser Ser Met 
335 

Ala Leu Asn Thr Gly 
350 

lie Glu Thr Thr Lys 
365 

Tyr Met Asp Glu Asn 
380 

Ala Arg Phe Arg Met 
400 

Phe Glu Pro Asp Leu 

415 

Thr Tyr Glu Thr Lys 
430 

lie His Gin Thr Thr 
445 

Asp lie Ser His Lys 
460 

Val Ser Gly Gly Leu 
480 

Val Leu Gin Ser Val 
495 

Asn Leu Gin Tyr Arg 
510 

Asn Tyr Phe Ala Gin 
525 

Ser Asp Arg Glu Val 
540 

Leu Asp Ser Gin Tyr 
560 



133 



Gly Leu Ser Tyr Ser Asn Glu Phe Thr Gin Tyr Arg lie Gly Leu Asn 

565 570 575 

Leu Lys Lys lie Ala Lys Thr Trp Asp Tyr Thr Val Gly Phe Asn Val 
580 585 590 

5 Asp Pro Asn Arg Thr Val Ser Tyr Arg Ser Val Ala Gly Val Glu Gin 
595 600 605 

Asp Lys Leu Ala Phe Asn Arg Val Asn Leu Ser Pro Met Leu Arg lie 

610 615 620 

Asn Tyr Lys Pro Ser Arg Thr Thr Asn Leu Arg Val Asp Tyr Arg Gly 
10 625 630 635 640 

Arg Thr Thr Gin Pro Ser lie Asn Gin lie Ala Pro Val Gin Asp lie 

645 650 655 

Thr Asn Pro Leu Phe Val Thr Glu Gly Asn Pro Gly Leu Lys Pro Ser 
660 665 670 

15 Tyr Ser Asn Asn Val Met Ala Met Phe Ser Asp Phe Asp Ala Lys Ser 
675 680 685 

Gin Arg Ala Phe Asn lie Val Phe Phe Gly Asn Tyr Thr Phe Asp Asp 

690 695 700 

lie Val Pro Asn Thr His Tyr Asp Pro Ser Thr Gly lie Arg Thr Thr 
20 705 . 710 715 720 

Arg Tyr Glu Asn Ala Ser Gly Thr Trp Gin Ala Asn Leu His Gly Thr 

725 730 735 

Leu Ser Leu Pro Leu Lys Asn Arg Ala Phe Ser Phe Arg Met Ser Leu 
740 745 7-50 

25 Phe Asn Arg Leu Ala Glu Gly Gin Ser Phe lie Asn Asp Asp Lys Asn 
755 760 765 

Lys Ala Leu Ser Phe Arg Thr Arg Glu Arg Leu Thr Leu Thr Tyr Arg 

770 775 780 

Asn Asn Trp lie Asp Thr Ser lie Gly Gly Asn lie Gly Phe Tyr Met 
30 785 790 795 800 

Ala Asn Asn Ser Leu Ser Gly Gin Lys Asp Ser Arg Thr Tyr Asp Phe 

805 810 815 

Gly Gly Asn Tyr Gin Val Ala Leu Thr Leu Pro Tyr Gly Phe Arg lie 
820 825 830 

35 Asp Ser Asp Val Glu Tyr Asn Thr Asn Ser Gly Tyr Ser Gly Gly Phe 
835 840 845 
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Ser Leu Asp Glu Trp Leu Trp Asn Ala Ser Leu Ser Tyr Ser Phe Leu 

850 855 860 

Arg Asp Lys Ala Gly Thr Leu Arg Val Asn Gly Tyr Asp lie Leu Gly 
865 870 875 880 

Gin Arg Ser Ser lie Ser Arg Ser Ala Ser Ala lie Asn lie Glu Glu 

885 890 895 

Ser Met Ser Asn Thr lie Gly Arg Tyr Val Met Val Asp Phe lie Tyr 

900 905 910 

Arg Phe Asn Ala Phe Ser Gly Gly Gly Ser Arg Ser Asp His Gin Arg 

915 920 925 

Gly Asn Met Asn Arg Pro Gly Pro Pro Phe Gly Gly Gly Arg Arg Pro 
930 935 940 

Ser 
945 

(2) INFORMATION FOR SEQ ID NO: 67 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 790 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .790 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
Leu Leu Phe Ser Ser Pro Leu Pro Arg Met Asp Arg Pro Lys Pro Ser 
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15 10 15 

Tyr lie Val Arg lie Ala Ala lie Leu Cys Leu Phe Val Gly Arg Pro 

20 25 30 

Leu Phe Ala Gin Ser Tyr Val Asp Tyr Val Asp Pro Leu lie Gly Thr 
5 35 40 45 

Leu Ser Ser Phe Glu Leu Ser Ala Gly Asn Thr Tyr Pro Val lie Gly 

50 55 60 

Leu Pro Trp Gly Met Asn Ser Trp Thr Pro Met Thr Gly Val Pro Gly 
65 70 75 80 

10 Asp Gly Trp Gin Tyr Thr Tyr Ser Ala His Lys lie Arg Gly Phe Lys 

85 90 95 

Gin Thr His Gin Pro Ser Pro Trp lie Asn Asp Tyr Gly Gin Phe Ser 

100 105 110 

Leu Leu Pro Leu Thr Ala Pro Gin Lys Pro Ser Ser Asn Asp Ser lie 
15 115 120 125 

Ala Leu Thr Lys Trp Cys Lys Gin Leu Phe Ser Asp Glu Gin Thr Ser 

130 135 140 

Trp Phe Ser His Lys Ala Glu Thr Ala Thr Pro Tyr Tyr Tyr Ser Val 
145 150 155 160 

20 Tyr Leu Ala Asp Tyr Asp Thr Arg Val Glu Met Ala Pro Thr Glu Arg 

165 170 175 

Ala Ala lie Phe Arg lie Arg Tyr Ser Gly Asn Thr Glu Ser Gly Ser 

180 185 190 

Gly Arg Trp Leu Arg Leu Asp Ala Phe Thr Gly Gly Ser Glu lie Ser 
25 195 200 205 

lie Val Asp Pro His Thr Val Val Gly lie Ser Arg Lys Asn Ser Gly 

210 215 220 

Gly Val Pro Ala Asn Phe Ala Cys Tyr Phe lie Leu Gin Ser Asp Thr 
225 230 235 240 

30 Pro Met Ala Asp Val Leu Leu Glu Thr Asp Thr Gly Lys Ser Asp Glu 

245 250 255 

Gly Thr Arg Ala Trp Ala Ala Cys Arg Phe Asp Ser Gin Glu Val Thr 

260 265 270 

Val Arg Val Ala Ser Ser Phe lie Ser Val Glu Gin Ala Glu Arg Asn 
35 275 280 285 

Leu Ala Glu Val Lys Gly Gin Ser Phe Asp Arg lie Arg Leu Ala Gly 
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290 295 300 

Arg Glu Ala Trp Asn Lys Val Leu Gly Arg lie His Val Glu Gly Gly 
305 310 315 320 

Thr Lys Asp Glu Arg Thr Thr Phe Tyr Ser Ala Leu Tyr Arg Cys Leu 

325 330 335 

Leu Phe Pro Arg Arg Phe Tyr Glu Glu Asp Ala Ser Gly Asn Phe Val 

340 345 350 

His Tyr Ser Pro Tyr Asn Gly Glu Val Leu Pro Gly Tyr Leu Tyr Thr 

355 360 365 

Asp Thr Gly Phe Trp Asp Thr Phe Arg Ala Leu Phe Pro Leu Leu Asn 

370 375 380 

Leu Leu Tyr Pro Asp Glu Asn lie Lys lie Gin Glu Gly Leu Leu Asn 
385 390 395 400 

Val Tyr Arg Glu Ser Gly Phe Phe Pro Glu Trp Ala Ser Pro Gly His 

405 410 415 

Arg Asp Cys Met lie Gly Asn Asn Ser Ala Ser Val Leu Ala Asp Ala 

420 425 430 

Tyr Leu Lys Gly Val Arg Val Glu Asp Thr Arg Thr Leu Met Asn Gly 

435 440 445 

Leu Leu His Ala Thr Lys Ala Val His Pro Lys lie Ser Ser Thr Gly 

450 455 460 

Arg Lys Gly Trp Glu Trp Tyr Asn Ser Leu Gly Tyr Val Pro Ala Asp 
465 470 475 480 

Ala Gly lie Asp Glu Ser Ala Ala Arg Thr Leu Glu Tyr Ala Tyr Asn 

485 490 495 

Asp Trp Cys lie Leu Arg Leu Gly Arg Thr Leu Gly Trp Asp Arg Ala 

500 505 510 

Ala Leu Asp Thr Leu Ala His Arg Ser Met Asn Tyr Arg His Leu Phe 

515 520 525 

Asp Pro Glu Thr Lys Leu Met Arg Gly Arg Asn Gin Asp Gly Ser Phe 

530 535 540 

Arg Thr Pro Phe Ser Pro Phe Lys Trp Gly Asp Val Phe Thr Glu Gly 
545 550 555 560 

Asn Ala Trp His Tyr Thr Trp Ser Val Phe His Asp Val Gin Gly Leu 

565 570 575 

He Asp Leu Met Gly Gly Asp Arg Pro Phe Val Ser Met Leu Asp Ser 
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580 

Val Phe Asn Thr 
595 

lie His Glu lie 
610 

His Gly Asn Gin 
625 

Gly His Pro Trp 

Leu Tyr Arg Pro 
660 

Gin Thr Ser Ala 
675 

Thr Pro Ala Thr 
690 

Val lie Leu Ser 
705 

Ala Asn Ser Ala 

Lys Glu Trp Ser 
740 

Ala Ser lie Gin 
755 

Met Lys Glu Ser 
770 

Arg Ala Asn His 
785 



Pro Pro Met Phe 
600 

Arg Glu Met Gin 
615 

Pro lie Gin His 
630 

Lys Ala Gin Glu 
645 

Thr Pro Asp Gly 

Trp Tyr Val Phe 
680 

Asp Gin Tyr Val 
695 

Phe Pro Asp Gly 
710 

Asp Thr Pro Tyr 
725 

Cys Asn Tyr Leu 

Trp Met Met Asp 
760 

Asp Arg Pro Tyr 
775 

Ser Asn 
790 



585 

Asp Glu Ser Tyr 

lie Ala Asp Met 
620 

Met lie Tyr Leu 
635 

Arg Leu Arg Glu 
650 

Tyr Cys Gly Asp 
665 

Ser Ala Leu Gly 

Leu Gly Ser Pro 
700 

His Lys Thr Val 
715 

He Arg Ser He 
730 

Thr His Glu Gin 
745 

Thr Lys Pro Asn 

Ser Phe Ser Thr 
780 



590 

Tyr Gly Phe Val 
605 

Gly Asn Tyr Ala 

Tyr Asn His Ala 
640 

Val Met Gly Arg 
655 

Glu Asp Asn Gly 
670 

Phe Tyr Pro Val 
685 

He Phe Ser Lys 

Leu His Ala Pro 
720 

Ser Val Glu Gly 
735 

Leu Arg Ser Ser 
750 

Tyr Asn Arg Gly 
765 

Glu Gin Gin Arg 



(2) INFORMATION FOR SEQ ID NO: 68 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .452 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

Leu lie lie Glu Lys Glu Met Lys Thr Thr Val Gin Gin lie lie Leu 

15 10 15 

Cys Leu Ala Leu Met Met Ser Gly Val Leu Gly Gly Asn Ala Gin Ser 

20 25 30 

Phe Trp Glu Glu lie Ala Pro Pro Phe lie Ser Asn Glu Pro Asn Val 

35 40 45 

Lys Tyr lie lie Pro Asn Met Gly lie Asp Ser Lys Gly Thr lie Tyr 

50 55 60 

Val Thr Val Thr Lys Arg lie Gin Gin Gly Ala Asn Tyr Thr Ser Glu 
65 70 75 80 

Gin Leu Gly Met Tyr Tyr Arg Pro Leu Gly Asp Asn Glu Gin Trp Trp 

85 90 - 95 

Lys His Asp Pro Tyr Phe Asp Asp Lys lie Val Ala Asp lie Gin Thr 

100 105 110 

Asp Ala Tyr Gly Arg Val Tyr Val Cys Thr Thr Ser Ser Arg Asp Gin 

115 120 125 

Glu Tyr Gin Leu Tyr lie Asn Glu Gin Asn Glu Trp Arg Cys lie Phe 

130 135 140 

Lys Thr Ser Val Ser Thr Tyr Glu His Gly Met Ala Val Phe Arg Ser 
145 150 155 160 

Ser Thr Gly Val Thr Tyr lie Gly Thr Arg His His He Phe Ala Ser 

165 170 175 

Gly Val Asn Asp Phe Glu Phe Asn Thr He Tyr Glu Asp Ser Thr Pro 
180 185 190 
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Met Ser Cys Arg Phe Ala Glu Ala Thr Asn Ser Gly Thr lie Tyr Leu 

195 200 205 

Ala Leu Met His Glu Thr Thr Met Ser Thr Thr lie Leu Thr Tyr Gin 

210 215 220 

Asn Gly Glu Phe Val Asp lie Ser Glu Ser Glu Leu Ser Asn Ser lie 
225 230 235 240 

lie Ala Ser Met Cys Ser Asn Lys Glu Gly Asp lie lie Ala Leu Val 

245 250 255 

Thr Ser Tyr Thr Gly Phe Met Ser Gly Thr Leu Ala lie Arg Lys Ala 

260 265 270 

Asp Glu Gly Lys Trp Gin Leu Val Gly Gly Asp lie Gin Asn Ala lie 

275 280 285 

Val Gin Asn lie Cys Met Met Asp Asp Asn Lys lie Ala Cys Glu Val 

290 295 300 

Phe Gly Thr Pro Asn Gly Val Asp Gly Arg Thr Arg Val Cys Val Ser 
305 310 315 320 

Asp Ala Ser Val Phe Asp Phe Glu Trp Tyr Glu Asp Glu lie Tyr Gly 

325 330 335 

Gly Leu lie Phe Asp Thr Phe Phe Tyr Ser Pro Trp Asp Lys Leu Leu 

340 345 350 

Tyr Ala Lys Phe Gly Gly lie Met Leu Arg Ser Lys Glu Ser Phe lie 

355 360 365 

Thr Ser Phe lie Ser Pro Thr Val Val Gin Gly Val Asp Val Tyr Thr 

370 375 380 - 

Leu Ala Gly Lys lie Arg lie Glu Ser Glu Thr Pro Val Ser Glu Val 
385 390 395 400 

Leu Leu Phe Asp Leu Ala Gly Arg Met Val Leu Arg Gin Thr lie Asp 

405 410 415 

Asn Lys lie Tyr Ser Asp lie Asp Thr Asn Gly Leu Lys Arg Ser Gly 

420 425 430 

lie Tyr Val Val Ser Val Arg Leu Ser Ser Gly Gin Val Phe Ser His 

435 440 445 

Lys Val Gin Val 
450 



(2) INFORMATION FOR SEQ ID NO: 69 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<iii) HYPOTHETICAL: YES 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 
15 (A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

20 Gly Leu Tyr Gly Met Ser Val Val Pro lie lie lie Tyr Leu Cys Gly 
15 10 15 

lie Ser Asn Tyr Ala Arg Leu Met lie lie Arg Cys Leu lie Arg Arg 

20 25 30 

Pro Arg Thr Val Leu Phe Gly Leu lie Phe Val Val Gly Leu Phe Ser 
25 35 40 45 

Ala Met Ala Gin Glu Lys Lys Asp Ser Leu Ser Thr Val Gin Pro Val 

50 55 60 

Pro Asn Ser Ser Met Val Glu Gin Thr Pro Leu Leu Ser lie Asp His 
65 70 75 80 

30 Pro Val Leu Pro Ala Ser Phe Gin Asn Thr Arg Thr Leu Lys Arg Phe 

85 90 95 

Arg Asp Lys His Leu Ser Asp Ala Leu Leu Asn Gly Leu Lys Pro His 

100 105 110 

Arg Ser Ser Leu Gin Leu Asn Glu Glu Leu Asn Phe Ala Ala Glu Arg 
35 115 120 125 

Arg Asp Phe Val Ser Pro Leu Leu Gin Thr Arg His Ala Ala Gly Val 
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130 135 140 

Leu Ser Trp Arg Pro Thr Asp Arg Met His Phe Tyr Thr Ser Gly Asn 
145 150 155 160 

lie Gly Leu Gly His Asp Leu Leu Thr Gly Val Arg Lys Asp Phe Gly 
5 165 170 175 

Trp Asn Ala Gly Ala Asp Phe Leu Leu Ser Gin Asn Leu Thr Ala His 

180 185 190 

Val Gin Gly Gly Trp Gin Gin Asn Phe Gly Phe lie Pro Met Thr Ala 
195 200 205 

10 Val Asn Gly Gin Leu Arg Trp Gin Ala Thr Glu Arg Leu Ser Phe Thr 
210 215 220 

Thr Gly lie Asp Tyr Arg Gin Val Gin Trp Asn Ala Phe Asp Asn Arg 
225 230 235 240 

Thr Phe Ser Leu Lys Gly Ser Ala Arg Tyr Glu Val Met Asp Asn Val 
15 245 250 255 

Phe Val Asn Gly Phe Gly Ser Tyr Pro Leu Tyr Ser Ser Thr Arg Ser 

260 265 270 

Gly Leu Asn Met Ala Val Pro Met His Gly Phe Gly Pro Gin Tyr Gly 
275 280 285 

20 Gly Ser Leu Glu Leu Lys Val Ser Glu Arg Phe Gly Phe Ala Val Gly 
290 295 300 

Met Glu Arg Glu Tyr Asn lie Trp Thr Arg Arg Trp Glu Thr His Tyr 
305 310 315 320 

Phe Ala Tyr Pro Val Phe Tyr Gly Asp Lys Lys 
25 325 330 

(2) INFORMATION FOR SEQ ID NO: 70 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



35 



<ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



4 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

5 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .248 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 



lie Lys Arg lie Glu Met Lys Arg lie Phe Thr Val Ala Leu Val Leu 
15 10 15 

Leu Ala Ser Val Thr Met Ala lie Gly Gin Ser Arg Pro Ala Leu Arg 
20 25 30 

15 Val Asp Ala Asn Phe Val Gly Ser Asn Gin Ser Met Lys Arg Asp Gly 
35 40 45 

Tyr Val Trp Asp Thr Lys Met Asn Val Gly Leu Arg Val Gly Ala Ala 

50 55 60 

Ala Glu Phe Met lie Gly Ser Arg Gly Phe Tyr Leu Ala Pro Gly Leu 
20 65 70 75 80 

Asn Tyr Thr Met Lys Gly Ser Lys Thr Glu Trp Asp lie Pro Glu Met 

85 90 95 

Val Pro Gly Thr Tyr lie Thr Met Val Ser Thr Arg Leu His Tyr Leu 
100 105 110 

25 Gin Leu Pro lie Asn Ala Gly Met Arg Phe Asp Leu Met Asn Asp Met 
115 120 125 

Ala Val Ser lie Glu Ala Gly Pro Phe Leu Ala Tyr Gly lie Tyr Gly 

130 135 140 

Thr Tyr Arg Gin Lys Leu Glu Gly Trp Lys Pro Asn Asn Tyr Ser Thr 
30 145 150 155 160 

Glu Phe Phe Gly Pro Thr Leu Gly Gly Pro Thr Asn lie Arg Trp Asp 

165 170 175 

lie Gly Ala Asn lie lie Ala Ala Phe His Tyr Lys Arg Tyr Tyr lie 
180 185 190 

35 Gin lie Gly Tyr Glu His Gly Phe Val Asp lie Val Ser Gly Gly Gly 
195 200 205 
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Ser Asp lie Pro Arg Leu Asn Asp Asn Arg Gin Ser Ser Ser Thr Thr 

210 215 220 

Ala Leu Arg Glu Lys Gly Asn Asn Glu Tyr Ala Tyr Asn Arg Asp Phe 
225 230 235 240 

Phe Val Gly lie Gly Tyr Arg Phe 
245 

(2) INFORMATION FOR SEQ ID NO: 71 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE": 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

Ser Leu lie Asn Asn Ser Arg Asp Trp Arg Ala Pro Val Arg Leu Ser 

15 10 15 

Leu Lys Lys Lys Thr Lys Thr Met Lys Thr Lys Val Leu Arg Lys Phe 

20 25 30 

Val Val Ala Ala Phe Ala Val Ala Thr Leu Cys Pro Leu Ala Gin Ala 

35 40 45 

Gin Thr Met Gly Gly Asp Asp Val Lys Val Val Gin Tyr Asn Gin Glu 

50 55 60 

Lys Leu Val Gin Thr Arg Met Ser Val Ala Asp Asn Gly Trp He Tyr 
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65 70 75 80 

Val Met Thr His Ser Gly Tyr Asp Thr Gly Asn Ser Asn Val Lys lie 

85 90 95 

Phe Arg Ser Lys Asp Gin Gly Ala Thr Tyr Gin Lys Leu Arg Asp Trp 
5 100 105 110 

Asp Pro Ser Asp Asp Tyr Gin Phe Gin Asp Phe Asp lie Val Val Thr 

115 120 125 

Gly Lys Asn Glu Ser Asp lie Lys lie Trp Ser Val Glu Leu Met Asn 
130 135 140 

10 Lys Pro Gly Gly Tyr Lys Ser Arg Val Ala Val Phe Ser Arg Asp Ala 
145 150 155 160 

Asn Ala Gin Asn Ala Lys Leu Val Tyr Lys Glu Asp Phe Ser Asn Val 

165 170 175 

Gin Leu Tyr Asp Val Asp lie Ala Ser Asn Tyr Arg Ser Pro Ser Ser 
15 180 185 190 

Leu Asn Asn Gly Gly Asn Pro Phe Ala Leu Ala Phe Ala Tyr Thr Gly 

195 200 205 

Phe Asn Asn Thr His Lys lie Ser Phe Val Asp Tyr Val Phe Ser Leu 
210 215 220 

20 Asn Gly Gly Gin Asn Phe Asn Lys Asn Leu Leu Phe Ser Gin Asp Gly 
225 230 235 240 

Glu Lys Lys lie Asp Lys Val Asp Leu Ser Leu Gly Ser Thr Ser Glu 

245 250 255 

Ser Met Gly His Asn Ala Trp Pro Leu Met Gly Val Val Phe Glu Met 
25 260 265 270 

Asn Lys Gin Gly Gly Lys Ser Asp lie Gly Phe Leu Ser Asn Phe Val 

275 280 285 

Asp Asn Asp Pro Glu Phe Gin Trp Ser Gly Pro lie Lys Val Ser Glu 
290 295 300 

30 Ser Asp Met Ser Phe Ser Pro Lys lie Gin Met Leu Leu Asp Glu Asp 
305 310 315 320 

Asn Asn Thr lie Asn Gly Glu Ser Cys His Asn Phe Met lie Thr Tyr 

325 330 335 

Ser Asp Tyr Asp Ser Glu Tyr Ser Asp Trp Asp lie Arg Tyr Val Tyr 
35 340 345 ' 350 

Pro Lys Lys Ser Phe Lys Tyr Glu Lys Gly Lys Thr Pro Thr Met Asp 
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355 360 365 

Asp Leu Val Glu Ala Phe Leu Thr Ala Ser Tyr Gin Ser Glu Thr Asn 

370 375 380 

Ser Gly Leu Gly Tyr Asp Lys Asn Ala Asn His Tyr Leu lie Thr Tyr 
385 390 395 400 

Ala Lys Lys Glu Glu Asn Gly Thr Asn Thr Leu Lys Tyr Arg Trp Ala 

405 410 415 

Asn Tyr Asp Lys lie His Asn Lys Asp Leu Trp Ser Asp Thr Phe Thr 

420 425 430 

Tyr Thr Ser Ser Ala Asn Ala Leu Tyr Thr Pro Gin Val Asp lie Asn 

435 440 445 

Pro Thr Lys Gly Leu Val Cys Trp Ser Trp Val Glu Tyr Leu Pro Gly 

450 455 460 

Lys Arg lie Val Trp Ser Asp Thr Gin Trp Thr His Ala Asn Gly Val 
465 470 475 480 

Glu Asp lie Val Met Gin Glu Gly Ser Met Lys Leu Tyr Pro Asn Pro 

485 490 495 

Ala Gin Glu Tyr Ala Val lie Ser Leu Pro Thr Ala Ala Asn Cys Lys 

500 505 510 

Ala Val Val Tyr Asp Met Gin Gly Arg Val Val Ala Glu Ala Ser Phe 

515 520 525 

Ser Gly Asn Glu Tyr Arg Leu Asn Val Gin His Leu Ala Lys Gly Thr 

530 535 540 

Tyr He Leu Lys Val Val Ser Asp Thr Glu Arg Phe Val Giu Lys Leu 
545 550 555 560 

He Val Glu 



(2) INFORMATION FOR SEQ ID NO: 72 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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10 



( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 378 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 



Gly He He Ser Val Leu Ser His Val Val Asp Arg Pro Gin Trp Gly 
15 10 15 

15 Ala Ser Pro Glu Ala Ala Gly Thr His Ser Val Tyr Ser He Leu His 
20 25 30 

Pro Ser Ala Gly He He Arg He Arg Ser Met Gly He He Ser Ala 

35 40 45 

Cys Arg He Ala He Leu Ala Gly Lys His Pro Arg Ser Gly Leu Ser 
20 50 55 60 

Arg Ala Asn Val Gly He Leu Ser Tyr Asn Pro Glu Asn Thr Pro Glu 
65 70 75 80 

Lys Lys Arg Lys Leu Gin Glu Lys Asn Val Phe Leu Gin He Arg Leu 
85 90 95 

25 Arg Gin Ser Phe Asn Asn Leu He Pro Ser Leu Pro Phe Arg He Asp 
100 105 HO 

Asn Thr Lys Lys He Thr Glu Met Lys Lys Thr Thr Leu Thr Gly Ser 

115 120 125 

He Cys Ala Leu Leu Leu Phe Leu Gly Leu Ser Ala Asn Ala Gin Ser 
30 130 135 140 

Lys Leu Lys He Lys Ser He Glu Ala Ala Thr Thr Phe Ser Ser Ala 
145 150 155 160 

Thr Ala Gly Asn Gly Phe Gly Gly Asn He Phe Gly Met Asp Met Ser 
165 170 175 

35 He Arg Met Arg Val His His Ser He Leu Pro Glu Gly Leu Asp Phe 
180 185 190 
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Ser Val Gly lie 
195 

Pro Lys Leu Met 
210 

Lys Val lie Val 
225 

Leu Gly Arg Phe 

Gly Ala Tyr Trp 
260 

lie Pro Lys Ser 

275 

Asp Leu Val Leu 
290 

Ser Pro Gly lie 
305 

Lys Ser Lys Tyr 

Leu Asn Leu Leu 
340 

Pro Leu Ser Ala 
355 

Phe Asn Phe Thr 
370 



His Glu Arg Arg 
200 

Tyr Thr Asn Val 
215 

Phe Glu Asp Ala 
230 

Leu lie Ser Leu 
245 

Gly Trp Thr Asn 

Lys Val His Tyr 
280 

Gin Ser Glu Asp 
295 

Gly Leu Lys Ser 
310 

Asp Phe Arg Leu 
325 

Tyr Pro Tyr Arg 

Leu Ser Pro Arg 
360 

Val Gly Leu Trp 
375 



Ala His Trp Glu 

Pro Ser lie lie 
220 

Glu Asp Phe Phe 
235 

Gly lie Ser Tyr 
250 

Asp Ala His lie 
265 

Asp Thr Tyr Thr 

Val Ala Thr Val 
300 

Ser lie Trp Trp 
315 

Gly Phe Ser Leu 
330 

Asn Phe Lys Leu 
345 

Met Asn His lie 
Thr Asn 



Glu Ala Gly Ser 
205 

Gly lie Val Glu 

Asp Lys Lys Ala 
240 

Thr Lys His Leu 
255 

Leu Phe Ser Pro 
270 

Arg Ala Gly Ser 
285 

Ser Asn Gly Phe 

Lys Met Pro lie 
320 

Gly Tyr Glu Tyr 
335 

Asp Gly Asn Lys 
350 

Gly His Val Gly 
365 



(2) INFORMATION FOR SEQ ID NO: 73 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis cofeature 

(B) LOCATION 1. . .286 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

He Cys Gly Ser Lys Met Asn Leu Ser Gly Leu Gin Ser Phe Thr Met 

15 10 15 

Met Lys Ser Met Arg Ser Val Leu Leu Leu Leu Phe Pro Leu Ser Leu 

20 25 30 

He Thr Ala Leu Gly Cys Ser Asn Asn Lys Ala Ala Glu Ser Lys Ser 

35 40 45 

Val Ser Phe Asp Ser Ala Tyr Leu Glu Arg Tyr lie Pro Leu Arg Ala 

50 55 60 

Asp He Asp Thr Pro Ser Leu His Val Met He Ser Tyr Val Tyr Pro 
65 70 75 80 

Ser Gly Asp Asp Met Leu Thr Glu He Phe Asn Gly Leu Leu Phe Gly 

85 90 95 

Asp Ser Leu Met Asp Ser Ser Ser Pro Glu Asn Ala Met Glu Gly Tyr 

100 105 110 

Ala Gin Met Leu Gly Glu Asp Tyr Arg Ser Asn Asn Ala Glu Ala Asn 

115 120 125 

Leu Gin Gly Leu Pro Ser Asp Leu Leu Asp Tyr He Tyr Lys Gin Glu 

130 135 140 

Asn Thr He Ala Tyr Cys Asp Thr Gly Leu He Ser Thr Arg He Asn 
145 150 155 160 

Thr Tyr Thr Tyr Glu Gly Gly Ala His Thr Glu Asn Thr Val Arg Phe 

165 170 175 

Ala Asn He Leu Arg Thr Thr Gly Lys Val Leu Glu Glu Arg Asp He 

180 185 190 

Phe Lys He Asp Tyr Ala Glu Arg Leu Ser Ala Leu He He Gly Gin 

195 200 205 

Leu Val His Asp Phe Gly Lys Thr Thr Pro Ala Glu Leu Asp Ala He 
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210 215 220 

Gly Phe Phe Asn Ala Glu Glu lie Gin Pro Asn Gly Asn Phe Met lie 
225 230 235 240 

Asp Asp Lys Gly Leu Thr Tyr Cys Phe Asn Glu Tyr Gin lie Ala Ala 
5 245 250 255 

Tyr Ala Arg Gly Ala Val Tyr Val Arg Leu Gly Tyr Asp Val Leu Ala 

260 265 270 

Pro Leu Leu Arg Asp Asp Ser Pro Leu Lys Arg Tyr Leu Pro 
275 280 285 



10 



30 



(2) INFORMATION FOR SEQ ID NO: 74 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 412 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

25 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . .412 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 



Gly Glu Tyr Pro Ala Asn Ser Asn Asp Lys Lys Glu Met Val Met Lys 
15 10 15 

Leu lie Lys Arg Ser Leu Leu Leu Leu Gly Ala Val Leu Leu lie Thr 
20 25 30 

35 Leu Pro Ala Tyr Ser Gin Asn Asp Asp lie Phe Glu Asp Asp lie Tyr 

35 40 45 
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Thr Ser Arg Lys Glu lie Arg Lys Gin Asn Gin Val Lys Asp Trp Gin 

50 55 60 

Asn Gin Glu Asp Gly Tyr Gly Asp Asp Thr Glu Tyr Thr Val Ala Ser 
65 70 75 80 

5 Asp Arg Asp lie Asp Ala Tyr Asn Arg Arg Asp Gly Gin Ser Tyr Asp 

85 90 95 

Gly Lys Lys Leu Ser Lys Asp Lys Lys Arg Asp Ser Thr Arg Ser Ser 

100 105 110 

Val Pro Gly Arg Tyr Ser Arg Arg Leu Ala Arg Phe Tyr Lys Pro Asn 
10 115 120 125 

Thr lie Val lie Ser Gly Ala Asp Asn Val Tyr Val Thr Asp Asp Gly 

130 135 140 

Glu Tyr Phe Val Tyr Gly Asp Glu Tyr Tyr Asp Asp Ala Ser Ser Val 
145 150 155 160 

15 Asn lie Tyr lie Asn Ser Pro Trp Cys Asp Pro Phe Pro Tyr Thr Ser 

165 170 175 

Trp Tyr Pro Ser Phe Ser Gly Trp Tyr Asn Tyr Thr Trp Asn Tyr Pro 

180 185 190 

Trp Phe Tyr Tyr Gly Ser His lie Gly Trp Gly Gly Tyr Tyr Pro Gly 
20 195 200 205 

Tyr Asn Trp Tyr Trp Ser Tyr Tyr Tyr Asp Pro Phe Tyr Asn Pro Tyr 

210 215 220 

Gly lie Gly Met Gly Trp Gly Tyr Pro Tyr Gly Trp Gly Ser Tyr Tyr 
225 230 235 - 240 

25 Gly Trp Gly Gly Tyr Pro Gly Val lie His His Tyr His His Tyr Pro 

245 250 255 

Lys Lys Thr Tyr Ser Asn Gly Gin His Ser Gly Ala Tyr Tyr Ser Tyr 

260 265 270 

Gly Arg Pro Asn Arg lie Lys Gly Gly Thr Ser Gly Ala Lys Leu Gly 
30 275 280 285 

Thr Gly Arg Tyr Asp Arg lie Gin Asn Ser Ser Ser Gin Lys Asn Lys 

290 295 300 

Phe Gly Leu Gin Ser Asn Lys Pro Asn Asn Asn Leu Gin Asn Val Lys 
305 310 315 320 

35 Ser Gly Arg Thr Gly Arg Ala Asn Arg Asp Arg Asn lie Glu Thr Val 

325 330 335 
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Thr Pro Asn Asn Gly Gin Lys Gin Asn Arg Pro Val Phe Gin Gin Asn 

340 345 350 

Gin Ser Gly Asn Asp Arg Pro Thr Gly Arg Asn lie Arg Ser Glu Arg 
355 360 365 

5 Gin Gly Glu Asn Asn Asp Arg Thr Phe Ser Thr Pro Ser Arg Ser Asn 
370 375 380 

Ser Asn Gly Gly Phe Ser Thr Pro Ser Arg Ser Ser Ser Gly Ser Met 
385 390 395 400 

Ser Gly Gly Gly Gly Arg Ser Gly Arg Gly Arg Asn 
10 405 410 

(2) INFORMATION FOR SEQ ID NO: 75 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis - 

25 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .211 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

Lys Arg. Lys Ser Met Lys Arg Met Leu Leu Leu Leu Val Val Leu Leu 

15 10 15 

Tyr Gly lie Ala Gly Arg Leu Ala Ala Gin Asp Val lie Arg Pro Trp 
35 20 25 30 

Ser Leu Gin Val Gly /Via Gly Tyr Ser Asp Thr Glu Asn lie Pro Gly 
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35 40 45 

Gly Phe Thr Tyr Gly Phe Tyr Leu Gly Lys Arg Met Gly Ser Phe Leu 

50 55 60 

Glu Val Gly Leu Ser Met Tyr Asn Ser Thr Arg Gin Thr Ala Asn Asn 
5 65 70 75 80 

Ala Asp Ser Phe Ala Ser Asn Glu Gly Asp Gly Ser Phe Gin Val Asn 

85 90 95 

Met Ser Ser Pro Asn Glu Lys Trp Ser Phe Phe Asp Ala Gly Ser Ala 
100 105 110 

10 Asn Cys Tyr Met lie Val Val Gly Val Asn Pro Leu His Leu Phe Trp 
115 120 125 

Gin Asn Ser Arg His Asn Leu Phe Leu Ala Val Gin Ala Gly Leu Ser 

130 135 140 

Asn Lys His Asn lie His Phe He Tyr Gly Asp Lys Gly Ala Lys Val 
15 145 150 . 155 160 

Ser He Tyr Thr Asn Ser Asn Thr Tyr He Gly Tyr Gly Ala Arg Val 

165 170 175 

Ala Tyr Glu Tyr Gin He His Lys Asn Val Gly Ala Gly Ala Ala Val 
180 185 190 

20 Met Tyr Asp His Gly Asn Lys Met Leu Thr Ala Met Ala Thr Leu Ser 
195 200 205 

Thr His Phe 
210 

25 (2) INFORMATION FOR SEQ ID NO: 76 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 86 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

35 



(vi) ORIGINAL SOURCE: 

* 
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(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1. . . 786 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 



lie Val Tyr Leu 
1 

Leu Thr lie Arg 
20 

Phe Ala Ala Asp 
35 

Glu Tyr Val Asn 
50 

Pro Gly Ala Val 
65 

Val Ser Gly Ser 

Ser Ala Pro Tyr 
100 

Val Asn Leu Ser 
115 

Met Ala Thr Ser 
130 

Ser Leu Ser Arg 
145 

Asp Lys Tyr Gly 

Leu Thr Glu Phe 
180 

Leu Gly Gin Ala 
195 

Asn Asp Ser Thr 
210 



Cys His Cys Met 
5 

Asn Phe Leu Leu 

Ala Gin Ser Ser 
40 

Pro Phe lie Gly 
55 

Leu Pro Asn Gly 
70 

Thr Glu Asn Arg 
85 

Ser Ala Asp Asn 

Gly Val Gly Cys 
120 

Gly Thr Phe Asp 
135 

Glu Tyr Ala Arg 
150 

lie Asp Ala Ala 
165 

Ala Phe Pro Glu 

Leu Ser Asn Glu 
200 

Val Val Gly Ser 
215 



Asn His Arg Arg 
10 

Phe Cys Cys Leu 
25 

Val Ser Ser Gly 

Thr Ala Asn Tyr 
60 

Leu Met Ser Val 
75 

Phe Asp Lys Asp 
90 

Ser Tyr Cys lie 
105 

Pro Glu Leu Ser 

Pro Asp Tyr Cys 
140 

Pro Gly Glu Tyr 
155 

Val Thr Val Thr 
170 

Gly Glu Gly His 
185 

Ser Gly Ala Ser 

Arg Leu Met Gly 
220 



Ser Lys Thr Met 
15 

Ser Leu lie Ala 
30 

Arg Arg Leu Thr 
45 

Gly Thr Thr Asn 

Thr Pro Phe Asn 
80 

Ser Arg Trp Trp 
95 

Gly Phe Ser His 
110 

Gly lie Leu Leu 
125 - 

Cys Tyr Gly Ser 

Lys Ala Val Leu 
160 

Glu Arg Thr Ala 
175 

lie Leu Leu Asn 
190 

Val Arg Phe Leu 
205 

Thr Phe Cys Tyr 
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Asn Pro Gin Ala Val Phe Arg Gin Tyr Phe Val Leu Gin Val Ser Arg 
225 230 235 240 

Arg Pro lie Ser Ala Gly Tyr Trp Lys Lys Gin Pro Pro Met Thr Val 
245 250 255 

5 Glu Ala Gin Trp Asp Ser Thr Ala Gly Lys Tyr Lys Gin Tyr Asp Gly 
260 265 270 

Tyr Lys Arg Glu Met Ser Gly Asp Asp lie Gly Val Arg Phe Ser Phe 

275 280 285 

Asn Cys Asp Gin Gly Glu Lys lie Tyr Val Arg Ser Ala Val Ser Phe 
10 290 295 300 

Val Ser Glu Ala Asn Ala Leu Tyr Asn Leu Glu Ala Glu Gin Glu Glu 
305 310 315 320 

Val Phe Lys Ser Val Gly Gly Asn Pro Ala Lys Ala Phe Ser Ala lie 
325 330 335 

15 Arg Ser Arg Ala lie Glu Arg Trp Glu Glu Ala Leu Gly Thr Val Glu 
340 345 350 

Val Glu Gly Gly Thr Pro Asp Glu Lys Thr lie Phe Tyr Thr Ala Leu 

355 360 365 

Tyr His Leu Leu lie His Pro Asn lie Leu Gin Asp Ala Asn Gly Glu 
20 370 375 380 

Tyr Pro Met Met Gly Ser Gly Lys Thr Gly Asn Thr Ala His Asp Arg 
385 390 395 400 

Tyr Thr Val Phe Ser Leu Trp Asp Thr Tyr Arg Asn Val His Pro Leu 
405 410 - 415 

25 Leu Cys Leu Leu Tyr Pro Glu Lys Gin Leu Asp Met Val Arg Thr Leu 
420 425 430 

lie Asp Met Tyr Arg Glu Ser Gly Trp Leu Pro Arg Trp Glu Leu Tyr 

435 440 445 

Gly Gin Glu Thr Leu Thr Met Glu Gly Asp Pro Ser Leu lie Val lie 
30 450 455 460 

Asn Asp Thr Trp Gin Arg Gly Leu Arg Ala Phe Asp Thr Ala Thr Ala 
465 470 475 480 

Tyr Glu Ala Met Lys Lys Asn Ala Ser Ser Ala Gly Ala Thr His Pro 
485 490 495 

35 lie Arg Pro Asp Asn Asp Asp Tyr Leu Thr Leu Gly Phe Val Pro Leu 
500 505 510 
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Arg Glu Gin Tyr Asp Asn Ser Val Ser His Ala Leu Glu Tyr Tyr Leu 

515 520 525 

Ala Asp Trp Asn Leu Ser Arg Phe Ala His Ala Leu Gly His Lys Glu 
530 535 540 

5 Asp Ala Ala Leu Phe Gly Lys Arg Ser Leu Gly Tyr Arg His Tyr Tyr 
545 550 555 560 

Asn Lys Glu Tyr Gly Met Leu Cys Pro Leu Leu Pro Asp Gly Ser Phe 

565 570 575 

Leu Thr Pro Phe Asp Pro Lys Gin Gly Glu Asn Phe Glu Pro Asn Pro 
10 580 585 590 

Gly Phe His Glu Gly Ser Ala Tyr Asn Tyr Ala Phe Phe Val Pro His 

595 600 605 

Asp lie Gin Gly Leu Ala Arg Leu Met Gly Gly Ala Lys Val Phe Ser 
610 615 620 

15 Glu Arg Leu Gin Lys Val Phe Asp Glu Gly Tyr Tyr Asp Pro Thr Asn 
625 630 635 640 

Glu Pro Asp lie Ala Tyr Pro Tyr Leu Phe Ser Tyr Phe Pro Lys Glu 

645 650 655 

Ala Trp Arg Thr Gin Lys Leu Thr Arg Glu Leu lie Asp Lys His Phe 
20 660 665 670 

Cys Asn Ala Pro Asn Gly Leu Pro Gly Asn Asp Asp Ala Gly Thr Met 

675 680 685 

Ser Ala Trp Leu Val Tyr Ser Met Leu Gly Phe Tyr Pro Asp Cys Pro 
690 695 700 

25 Gly Ser Pro Thr Tyr Thr Leu Thr Ser Pro Val Phe Pro Arg Val Arg 
705 710 715 720 

lie Arg Leu Asn Pro Gin Tyr Tyr Pro Gin Gly Glu Leu lie lie Thr 

725 730 735 

Thr Asn Thr Glu Asn Gin Pro Thr Asp Ser lie Tyr He His Thr Val 
30 740 745 750 

Ser Leu Gly Asn Lys Thr Leu Pro His Gly Thr Arg His He Ser His 

755 760 765 

Ala Asp Leu Val Arg Cys Gly His Leu Arg Tyr Glu Leu Ser Asn Arg 
770 775 780 

35 Pro Arg 
785 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 77 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 953 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

lie Arg Met Arg Val Ser Asp Leu Cys Ser Arg Leu Ser Trp Leu Leu 

15 10 15 

Pro Val lie Leu Val Gly Leu Leu Cys Ala Thr Leu Val Ala Ala Glu 

25 20 25 30 

Arg Pro Met Ala Gly Ala Val Gly Leu His His Arg Arg His Ala Ala 

35 40 45 

Leu Ser Asp Ser Thr Ala Lys Asp Thr Val Pro Leu Ala Lys Pro lie 
50 55 60 

30 Pro Asp Ser Ala Phe Arg Asp Ser Leu Pro Ala Asp Ser Thr Gly Ser 
65 70 75 80 

Met Arg Gin Asp Ser Val Tyr Asp Asp Glu Phe Glu Leu Glu Asp lie 

85 90 95 

Val Glu Tyr Glu Ala Ala Asp Ser lie Val Leu Leu Gly Gin Asn Arg 

35 100 105 110 

Ala Tyr Leu Phe Gly Lys Ser Tyr Val Ser Tyr Gin Lys Ser Arg Leu 
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115 120 125 

Glu Ala Asn Phe Met Tyr Leu Asn Thr Asp Ser Ser Thr Val Tyr Thr 

130 135 140 

Arg Tyr Val Leu Asp Thr Ala Gly Tyr Pro Met Ala Phe Pro Val Phe 
5 145 150 155 160 

Lys Asp Gly Glu Gin Ser Phe Glu Ala Lys Asn Phe Thr Tyr Asn Phe 

165 170 175 

Arg Thr Glu Lys Gly lie lie Ser Gly Val lie Thr Gin Gin Gly Glu 
180 185 190 

10 Gly Tyr Leu Thr Ala Gly Lys Thr Lys Lys Met Pro Asp Asn lie Met 
195 200 205 

Phe Met Gin Gly Gly Arg Tyr Thr Thr Cys Asp Asn His Asp His Pro 

210 215 220 

His Phe Tyr lie Asn Leu Ser Lys Ala Lys Val His Pro Glu Lys Asp 
15 225 230 235 240 

lie Val Thr Gly Pro Val Asn Leu Val lie Ala Asp Met Pro Leu Pro 

245 250 255 

lie Gly Leu Pro Phe Gly Tyr Phe Pro Phe Ser Asn Lys Tyr Ser Ser 
260 265 270 

20 Gly lie Leu Met Pro Thr Tyr Gly Glu Asp Asn Arg Tyr Gly Phe Tyr 
275 280 285 

Leu Arg Asn Gly Gly Tyr Tyr Phe Ala Phe Ser Asp Tyr lie Asp Leu 

29.0 295 300 

Ala Leu Arg Gly Glu lie Phe Ser Lys Gly Ser Trp Gly lie Ser Ala 
25 305 310 315 320 

Gin Ser Lys Tyr Lys Lys Arg Tyr Lys Tyr Asn Gly Ser Phe Glu Ala 

325 330 335 

Asn Tyr Leu Val Ser Lys Ser Gly Asp Lys Tyr Val Pro Gly Asp Tyr 
340 345 350 

30 Ser Lys Thr Thr Ser Leu Asn lie Arg Trp Thr His Ser Gin Asp Pro 
355 360 365 

Lys Ala Asn Pro Leu Gin Thr Leu Ser Ala Asn Val Asn Phe Ala Thr 

370 375 380 

Gly Ser Tyr Phe Gin Asn Ser Leu Asn Thr Thr Tyr Asp Val Asn Ala 
35 385 390 395 400 

Arg Thr Ala Thr Thr Arg Ser Ser Ala Val Ser Tyr Ser Arg Lys Phe 
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405 410 415 

Pro Gly Thr Pro Phe Ser lie Thr Gly Ser Met Asp lie Ser Gin Asn 

420 425 430 

Met Arg Asp Thr Thr Val Ser Leu Thr Leu Pro Asn Leu Ser lie Asn 
5 435 440 445 

Met Ser Thr Arg Tyr Pro Phe Lys Arg Lys Thr Arg Val Gly Pro Glu 

450 455 460 

Arg Trp Tyr Glu Lys Leu Ser Val Gly Tyr Ser Gly Gin Leu Arg Asn 
465 470 475 480 

10 Ser lie Leu Thr Lys Glu Lys Asp Leu Leu Gin Ser Asn Leu Val Arg 

485 490 495 

Asp Trp Lys Asn Gly Met Arg His Ser Val Pro lie Ser Leu Thr Val 

500 505 510 

Pro Leu Leu Asp Tyr lie Asn Leu Thr Met Gly Val Asn Tyr Asn Glu 
15 515 520 525 

Trp Trp Tyr Thr Lys Gly lie Arg Lys Ser Trp Asn Glu Asp Lys Lys 

530 535 540 

Thr Phe Leu Pro Ser Asp Thr Thr Tyr Lys Phe Arg Arg Leu Tyr Asp 
545 550 555 560 

20 Tyr Ser Leu Ser Ala Gly Leu Ser Thr Thr Leu Tyr Gly Met Phe Lys 

565 570 575 

Pro Trp Lys Pro Phe Ser Phe Gly Gly Asn Leu lie Met lie Arg His 

580 585 590 

Arg Phe Thr Pro Thr Val Ser Phe Ser Tyr Met Pro Asp Phe Thr Lys 
25 595 600 605 

Arg Arg Tyr Gly Phe Trp Glu Leu Leu Glu His Thr Asp Gin Asn Gly 

610 615 620 

Lys Leu His Thr Leu Leu Tyr Ser Pro Tyr Phe Glu Gin lie Phe Gly 
625 630 635 640 

30 Ala Pro Ser Met Gly Asn Ala Gly Ser Val Asn Phe Ser Phe Asp Asn 

645 650 655 

Asn Leu Glu Ala Lys lie Lys Ser Lys Ser Asp Ser Thr Gly lie Lys 

660 665 670 

Lys lie Ser Leu lie Asp Gin Phe Thr Trp Ser Thr Ser Tyr Asn Met 
35 675 680 685 

Phe Ala Asp Ser lie Arg Trp Ser Asn lie Ser Ala Ser Leu Ala Leu 
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690 695 700 

Arg Leu Ser Lys Ser Phe Thr Leu Arg Leu Ser Gly Leu Phe Asp Pro 
705 710 715 720 

Tyr Leu Thr Lys Tyr Tyr Glu Gly Glu Asp Gly Lys lie lie Pro Tyr 
5 725 730 735 

Lys Ser Asn Asp Leu Arg lie Phe Asn Gly Lys Gly Leu Ala Arg Leu 

740 745 750 

lie Ser Thr Gly Thr Ser Phe Ser Tyr Thr Leu Asn Lys Glu Ser Leu 
755 760 765 

10 Ser Gly Leu lie Ala Leu Phe Ser Gly Lys Lys Glu Arg Arg Asp Glu 
770 775 780 

Lys Lys Asn Thr Gly Ala Thr Pro His Glu Gly Asp Asp Ala Ala Asp 
785 790 795 800 

lie Leu Glu Gly Gly Arg Pro Gin Asn Glu Ser Gly Gly Ser Leu Leu 
15 805 810 815 

Glu Arg Asn Arg Gin Gly Gly Ala Val Asp Gin Asp Gly Tyr Phe Ala 

820 825 830 

Tyr Ser lie Pro Trp Ser Leu Ser Phe Asp Tyr Ser Trp Asn lie Ala 
835 840 845 

20 Thr Asp Tyr Asn Arg Tyr Asn Val Asn Lys Met Glu His Tyr Tyr Arg 
850 855 860 

Val Thr Gin Asn Leu Ser Phe Arg Gly Asn lie Gin Pro Thr Pro Asn 
865 870 875 880 

Trp Ser Phe Gly Phe Asn Ala Asn Tyr Asn Phe Asp Leu Lys Lys lie 
25 885 890 895 

Thr Ser Leu Thr Cys Asn Val Thr Arg Asp Met His Cys Trp Ala lie 

900 905 910 

Ser Ala Ser Phe lie Pro He Gly Ala Tyr Lys Ser Tyr Asn Phe Val 
915 920 925 

30 He Ser Val Lys Ser Ser Leu Leu Gin Asp Leu Lys Tyr Gin Gin Ser 
930 935 940 

Asn Arg Pro He Thr Asn Thr Trp Tyr 
945 950 

35 (2) INFORMATION FOR SEQ ID NO: 78 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 601 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78 

Ser Asn Ser Ser Ser His Lys Trp Leu He Tyr Tyr His He Glu Lys 

15 10 15 

Thr Lys Ser He Met He Arg Lys Leu He Leu Leu Leu Ala Leu Met 

20 25 30 

Pro Val Ala Ser Val Ala Phe Ala Val Pro Thr Asp Ser Thr Glu Ser 

35 40 45 

Lys Asp Asn Arg He Leu Thr Ser Met Gin Ser Ser Ser Leu Asn Arg 

50 55 60 

Asp Asp Ala Pro Asp Lys Trp Gin Pro Met His Ala Asn Phe Ser He 
65 70 75 80 

Gin Ser Asp Met Leu Leu Ser Thr Ala Gin Lys Ser Lys Asn Thr Trp 

85 90 95 

Phe Gly Asn Ser Tyr He Met Gly He He Lys Asn Asn Tyr Leu Glu 

100 105 HO 

Phe Gly Ala Arg Phe Glu Asp Leu Tyr Lys Pro Leu Pro Gly His Glu 

115 120 125 

Pro Glu Met Unk Arg Gly Val Pro His Met Tyr Val Lys Gly Ser Tyr 
130 135 140 
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His Trp Ala Glu Leu Thr Met Gly Asp Phe Tyr Asp Gin Phe Gly Ser 
145 150 155 160 

Gly Met Val Phe Arg Thr Tyr Glu Glu Arg Asn Leu Gly lie Asp Asn 
165 170 175 

5 Ala Val Arg Gly Gly Arg lie Val Leu Thr Pro Phe Asp Gly Val Arg 
180 185 190 

Val Lys Gly He Ala Gly Gin Gin Arg Asn Tyr Phe Asp Arg Thr Gly 

195 ' 200 205 

Lys Val Phe Asn Ser Gly Arg Gly Tyr Leu Leu Gly Ser Asp Leu Glu 
10 210 215 220 

Leu Asn Val Glu Arg Trp Ser Ser Ala Met Arg Asp Asn Asp Tyr His 
225 230 235 240 

Leu Ala He Gly Gly Ser Phe Val Ser Lys His Glu Ala Asp Glu Asp 
245 250 255 

15 He Phe Val Gly Val Gly Glu Asp Arg Lys Arg Leu Asn Leu Pro Leu 
260 265 270 

Asn Val Pro He Met Gly Leu Arg Thr Asn Phe Gin Lys Gly Gly Leu 

275 280 285 

Ala Leu Tyr Ala Glu Tyr Gly Tyr Lys Tyr Asn Asp Pro Ser Ala Asp 
20 290 295 300 

Asn Asp Tyr He Tyr His Asp Gly Gin Ala Ala Leu Leu Ser Ala Ser 
305 310 315 320 

Tyr Ser Lys Lys Gly Met Ser He Leu Leu Gin Ala Lys Arg Cys Glu 
325 330 - 335 

25 Asn Phe Ala Phe Arg Ser Lys Arg Ser Ala Gin Leu Thr Pro Leu Met 
340 345 350 

He Asn Tyr Met Pro Ala Phe Thr Gin Ala His Thr Tyr Thr Leu Ala 

355 360 365 

Ala He Tyr Pro Tyr Ala Thr Gin Pro Gin Gly Glu Trp Ala Phe Gin 
30 370 375 380 

Gly Glu Leu Arg Tyr Asn Phe Ala Arg Arg Thr Ala Leu Gly Gly Arg 
385 390 395 400 

Tyr Gly Thr Gly Leu Arg He Asn Val Ser His Val Arg Gly Leu Asp 
405 410 415 

35 Lys Lys Met Leu Lys Glu Asn Pro Asp Glu Leu He Gly Thr Asp Gly 
420 425 430 
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Tyr Thr Val Ser Phe Phe Gly Met Gly Asp Leu Tyr Tyr Ser Asp lie 

435 440 445 

Asp Val Glu lie Thr Lys Lys Val Ser Pro Gly Phe Asn Phe Thr Leu 

450 455 460 

Thr Tyr Leu Asn Gin lie Tyr Asn Asn Lys Val Leu His Gly Ala Ala 
465 470 475 480 

Gly Glu Lys Pro Glu Lys lie Tyr Ala Asn lie Phe Val Tyr Asp Gly 

485 490 495 

Lys Tyr Lys Leu Ser Asn Lys Val Ala Leu Arg Thr Glu Leu Gin Tyr 

500 505 510 

Leu His Thr Lys Gin Asp Gin Gly Asp Trp lie Tyr Gly Met Ala Glu 

515 520 525 

Leu Ser lie Leu Pro Ser Leu Met Leu Ser Leu Ser Glu Gin Tyr Asn 

530 535 540 

lie Gly Glu Thr Lys Lys His Tyr Val Met Gly Ser Val Thr Tyr Thr 
545 550 555 560 

His Gly Ala His Arg Val Ala Phe Ser Ala Gly Lys Thr Arg Ala Gly 

565 570 575 

Met Asn Cys Ser Gly Gly Val Cys Arg Val Val Pro Glu Thr Gin Gly 

580 585 590 

Phe Tyr Leu Ser Tyr Ser Thr Asn Leu 
595 600 

(2) INFORMATION FOR SEQ ID NO: 79 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroraonas gingivalis 
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(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 962 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 



Ala lie Phe Val 
1 

10 Tyr Asn Ser Asp 
20 

Ser Gly Leu Ala 
35 

Ala Gin lie Ser 
15 50 

Ala Gly Thr His 
65 

Asp Phe Asn Pro 

20 Arg Asp Gly Arg 
100 

Asp Phe Ala Ser 
115 

Val Tyr Arg Leu 
25 130 

Tyr Tyr Asp Ala 
145 

Thr Pro Asp His 

30 Arg Arg Asn Gly 
180 

lie Met Asp Tyr 
195 

lie Ser Gly Ala 
35 210 

Thr Asp Asn His 



Val Ser Leu Gin lie Glu 
5 10 
Lys Ser Met Asn Lys Phe 

25 

Ala Phe Val Ser Met Ala 
40 

Phe Gly Gly Glu Pro Leu 
55 

Ser Phe Asp Asp Ala Met 
70 

Glu Asp Leu lie Ala Gin 
85 90 
Pro Val Arg lie Gly Gin 
105 

Lys Ala Ser His lie Ser 
120 

Gin Phe Lys Leu Glu Gly 
135 

Phe Asn lie Pro Glu Gly 
150 

Glu lie Val Leu Gly Ala 
165 170 
Ala Phe Ala Thr Glu Pro 
185 

Glu Val Ser Arg Gly Gly 
200 

Gly Tyr lie Phe Asp Lys 
215 

Tyr Gly lie Gly Glu Asp 



Lys lie Thr Glu Asn Lys 
15 

Tyr Lys Ser Leu Leu Gin 
30 

Thr Ala Leu Thr Ala Ser 
45 

Ser Phe Ser Ser Arg Ser 
60 

Thr lie Arg Leu Thr Pro 
75 80 
Ser Arg Trp Gin .Ser Gin 
95 

Val lie Pro Val Asp Val 
110 

Ser lie Gly Asp Val Asp 
125 

Ala Lys Ala lie Thr Leu 
140 

Gly Arg Leu Tyr lie Tyr 
155 160 
Tyr Thr Asn Ala Thr His 
175 

Val Pro Gly Ser Glu Leu 
190 

Thr Leu Pro Asp lie Lys 
205 

Val Gly Gly Arg Pro Val 
220 

Asp Ser Asp Ser Asp Cys 
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225 230 235 240 

Glu lie Asn He Asn Cys Pro Glu Gly Ala Asp Trp Gin Ala Glu Lys 

245 250 255 

Asn Gly Val Val Gin Met He Met Val Lys Gly Gin Tyr He Ser Met 
5 260 265 270 

Cys Ser Gly Asn Leu Leu Asn Asn Thr Lys Gly Asp Phe Thr Pro Leu 

275 280 285 

He He Ser Ala Gly His Cys Ala Ser He Thr Thr Asn Phe Gly Val 
290 295 300 

10 Thr Gin Ser Glu Leu Asp Lys Trp He Phe Thr Phe His Tyr Glu Lys 
305 310 315 320 

Arg Gly Cys Ser Asn Gly Thr Leu Ala He Phe Arg Gly Asn Ser He 

325 330 335 

He Gly Ala Ser Met Lys Ala Phe Leu Pro He Lys Gly Lys Ser Asp 
15 340 345 350 

Gly Leu Leu Leu Gin Leu Asn Asp Glu Val Pro Leu Arg Tyr Arg Val 

355 360 365 

Tyr Tyr Asn Gly Trp Asp Ser Thr Pro Asp He Pro Ser Ser Gly Ala 
370 375 380 

20 Gly He His His Pro Ala Gly Asp Ala Met Lys He Ser He Leu Lys 
385 390 395 400 

Lys Thr Pro Ala Leu Asn Thr Trp He Ser Ser Ser Gly Ser Gly Gly 

405 410 415 

Thr Asp Asp His Phe Tyr Phe Lys Tyr Asp Gin Gly Gly Thr Glu Gly 
25 420 425 430 

Gly Ser Ser Gly Ser Ser Leu Phe Asn Gin Asn Lys His Val Val Gly 

435 440 445 

Thr Leu Thr Gly Gly Ala Gly Asn Cys Gly Gly Thr Glu Phe Tyr Gly 
450 455 460 

30 Arg Leu Asn Ser His Trp Asn Glu Tyr Ala Ser Asp Gly Asn Thr Ser 
465 470 475 480 

Arg Met Asp He Tyr Leu Asp Pro Gin Asn Asn Gly Gin Thr Thr He 

485 490 495 

Leu Asn Gly Thr Tyr Arg Asp Gly Tyr Lys Pro Leu Pro Ser Val Pro 
35 500 505 510 

Arg Leu Leu Leu Gin Ser Thr Gly Asp Gin Val Glu Leu Asn Trp Thr 
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515 520 525 

Ala Val Pro Ala Asp Gin Tyr Pro Ser Ser Tyr Gin Val Glu Tyr His 

530 535 540 

lie Phe Arg Asn Gly Lys Glu lie Ala Thr Thr Lys Glu Leu Ser Tyr 
5 545 550 555 560 

Ser Asp Ala He Asp Glu Ser He He Gly Ser Gly He He Arg Tyr 

565 570 575 

Glu Val Ser Ala Arg Phe He Tyr Pro Ser Pro Leu Asp Gly Val Glu 
580 585 590 

10 Ser Tyr Lys Asp Thr Asp Lys Thr Ser Ala Asp Leu Ala He Gly Asp 
595 600 605 

He Gin Thr Lys Leu Lys Pro Asp Val Thr Pro Leu Pro Gly Gly Gly 

610 615 620 

Val Ser Leu Ser Trp Lys Val Pro Phe Leu Ser Gin Leu Val Ser Arg 
15 625 630 635 640 

Phe Gly Glu Ser Pro Asn Pro Val Phe Lys Thr Phe Glu Val Pro Tyr 

645 650 655 

Val Ser Ala Ala Ala Ala Gin Thr Pro Asn Pro Pro Val Gly Val Val 
660 665 670 

20 He Ala Asp Lys Phe Met Ala Gly Thr Tyr Pro Glu Lys Ala Ala He 
675 680 685 

Ala Ala Val Tyr Val Met Pro Ser Ala Pro Asp Ser Thr Phe His Leu 

690 695 700 

Phe Leu Lys Ser Asn Thr Asn Arg Arg Leu Gin Lys Val Thr Thr Pro 
25 705 710 715 720 

Ser Asp Trp Gin Ala Gly Thr Trp Leu Arg He Asn Leu Asp Lys Pro 

725 730 735 

Phe Pro Val Asn Asn Asp His Met Leu Phe Ala Gly He Arg Met Pro 
740 745 750 

30 Asn Lys Tyr Lys Leu Asn Arg Ala He Arg Tyr Val Arg Asn Pro Asp 
755 760 765 

Asn Leu Phe Ser He Thr Gly Lys Lys He Ser Tyr Asn Asn Gly Val 

770 775 780 

Ser Phe Glu Gly Tyr Gly He Pro Ser Leu Leu Gly Tyr Met Ala He 
35 785 790 795 800 

Lys Tyr Leu Val Val Asn Thr Asp Ala Pro Lys He Asp Met Ser Leu 
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805 810 815 

Val Gin Glu Pro Tyr Ala Lys Gly Thr Asn Val Ala Pro Phe Pro Glu 

820 825 830 

Leu Val Gly lie Tyr Val Tyr Lys Asn Gly Thr Phe lie Gly Thr Gin 
5 835 840 845 

Asp Pro Ser Val Thr Thr Tyr Ser Val Ser Asp Gly Thr Glu Ser Asp 

850 855 860 

Glu Tyr Glu lie Lys Leu Val Tyr Lys Gly Ser Gly lie Ser Asn Gly 
865 870 875 880 

10 Val Ala Gin He Glu Asn Asn Asn Ala Val Val Ala Tyr Pro Ser Val 

885 890 895 

Val Thr Asp Arg Phe Ser He Lys Asn Ala His Met Val His Ala Ala 

900 905 910 

Ala Leu Tyr Ser Leu Asp Gly Lys Gin Val Arg Ser Trp Asn Asn Leu 
15 915 920 925 

Arg Asn Gly Val Thr Phe Ser Val Gin Gly Leu Thr Ala Gly Thr Tyr 

930 935 940 

Met Leu Val Met Gin Thr Ala Asn Gly Pro Val Ser Gin Lys He Val 
945 950 955 960 

20 Lys Gin 



(2) INFORMATION FOR SEQ ID NO: 80 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . . 1312 

5 <xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 0 

Lys Arg Val Asp Ser Tyr Glu Cys Arg Gin Lys Ala Cys Lys Cys Ala 
15 10 15 

lie Cys Val lie Gin Lys Phe Thr Asn Val Lys Leu Asn Asp Met Arg 
10 20 25 30 

Lys lie Leu Ser Phe Leu Met Met Cys Ser Leu His Leu Gly Leu Gin 

35 40 45 

Ser Gin Thr Trp His Gly Asp Pro Asp Ser Val Ala Ala Leu Pro Ser 
50 55 60 

15 lie Gly lie Gin Glu Ser Ser Cys Thr Arg lie Thr Phe Glu Val Val 
65 70 75 80 

Phe Pro Gly Phe Tyr Ser Val Glu Lys Arg Glu Gly Asn Gin Val Phe 

85 90 95 

Gin Arg lie Ser Met Pro Gly Cys Gly Ser Phe Gly Asn Leu Gly Glu 
20 100 105 110 

Ala Glu Leu Pro Val Leu Lys Lys Met lie Ala Val Pro Glu Phe Ser 

115 120 125 

Thr Ala Asn Val Ala Val Lys lie Lys Glu Thr Glu Thr Phe Asp Asn 
130 135 140 

25 Tyr Asn lie Tyr Pro Asn Pro Thr Tyr Val Val Glu Glu Leu Pro Glu 
145 150 155 160 

Gly Gly Thr Tyr Leu Val Glu Ala Phe Ala lie Asn Asn Asp Tyr Tyr 

165 170 175 

Ser Gin Asn Val Ser Leu Pro Ser Thr His Tyr Val Tyr Ser Gin Asp 
30 180 185 190 

Gly Tyr Phe Arg Ser Gin Arg Phe lie Glu Val Thr Leu Tyr Pro Phe 

195 200 205 

Arg Tyr Asn Pro Val Arg Gin Glu lie Leu Phe Ala Lys Lys lie Glu 
210 215 220 

35 Val Thr lie Thr Phe Asp Asn Pro Gin Pro Pro Leu Gin Lys Asn Thr 
225 230 235 240 
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Gly lie Phe Asn Lys Val Ala Ser Ser Ala Phe lie Asn Tyr Glu Ala 

245 250 255 

Asp Gly Lys Ser Ala lie Glu Asn Asp Met Val Phe Ser Arg Gly Thr 
260 265 270 

5 Thr Thr Tyr lie Ser Gly Asn Val Ala Ser Asn Leu Pro Gin Asn Cys 
275 280 285 

Asp Tyr Leu Val lie Tyr Asp Asp Met Phe Asn Val Asn Gin Gin Pro 

290 295 300 

His Asp Glu lie Lys Arg Leu Cys Glu His Arg Ala Phe Tyr Asn Gly 
10 305 310 315 320 

Phe Asp Val Ala Ala Val Ser lie Lys Asp Val Leu Asn Ser Phe Pro 

325 330 335 

Ser Asn Ala Thr Ser Tyr lie Asn Glu Thr Lys Leu Lys Asn Phe lie 
340 345 350 

15 Arg Ser Val Tyr Asn Gin Ser Asn Ala Lys Arg Thr Leu Asp Gly Lys 
355 360 365 

Leu Gly Tyr Val Leu Leu lie Gly Lys Pro Leu Ser Lys Tyr Leu Ala 

370 375 380 

Asp Thr Asp Asn Thr Lys Val Pro Thr Ser Phe lie His Asn Val Ser 
20 385 390 395 400 

Leu lie Pro Ser His Pro Thr Phe Gly Ser lie Cys Ala Ser Asp Tyr 

405 410 415 

Phe Phe Ser Cys Val Ser Pro Leu Asp Thr Val Gly Asp Leu Phe lie 
420 425 - 4-30 

25 Gly Arg Phe Ser Val Thr Asn Ala His Glu Leu His Asn Leu lie Glu 
435 440 445 

Lys Thr lie Asn Lys Glu lie Ser Tyr Asn Pro lie Ala His Lys Asn 

450 455 460 

lie Leu Tyr Ala Glu Gly Lys Gly Cys Asp Ala Pro lie Leu Arg Leu 
30 465 470 475 480 

Phe Leu Lys Glu lie Ala Ser Gly Tyr Thr Val Asn Ser lie Leu Lys 

485 490 495 

Ser Asn Gin Val Ser Ala lie Asp Ser lie Phe Asp Cys Leu Asn Asn 
500 505 510 

35 Gly Ser His His Phe Tyr Phe Asn Thr His Gly Met Pro Thr Val Trp 
515 520 525 
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Gly lie Gly Gin Gly Leu Asp Val Asn Thr Leu Thr Ala Arg Leu Asn 

530 535 540 

Asn Thr Ser Ser Gin Gly Leu Cys Thr Ser Leu Ser Cys Ser Ser Ala 
545 550 555 560 

5 Val Ala Asp Ser Thr lie Arg Ser Leu Gly Glu Val Leu Thr Thr Tyr 

565 570 575 

Ala Pro Asn Lys Gly Phe Ser Ala Phe Leu Gly Gly Ser Arg Ala Thr 

580 585 590 

Gin Tyr Ala Val Tyr Leu Glu Gly Pro Cys Pro Pro Ser Glu Phe Tyr 
10 595 600 605 

Glu Tyr Leu Pro Tyr Ser Leu Tyr His Asn Leu Ser Thr Val Val Gly 

610 615 620 

Glu Met Leu Leu Ser Ser lie lie Asn Thr Asn Ser Val Asp Thr Tyr 
625 630 635 640 

15 Ser Lys Phe Asn Phe Asn Leu Leu Gly Asp Pro Ala Leu Asn lie Met 

645 650 655 

Ala His Gly Met Glu Val Ser Asn Cys lie Thr Leu Pro Asn Asn Thr 

660 665 670 

lie lie Ser Ser Pro lie Thr lie Lys Asn Gly Gly Cys Leu Lys lie 
20 675 680 685 

Pro Glu Lys Gly Val Leu His Phe Thr Asn Asn Gly Ser lie Gin Val 

690 695 700 

Met Ser Gly Gly Thr Leu Glu lie Gly Asn Gin Ala Lys lie Ser Gly 
705 710 715 - 720 

25 Glu Thr Gly Ala Asn Pro Thr Phe lie Thr Val Tyr Gly Asp Gly Leu 

725 730 735 

Ala lie Asn Lys Gin Val Glu lie Asp Asn lie Asp Arg Leu Asn Leu 

740 745 750 

Phe Ser Thr His Ser Val Met Pro Lys Phe His Phe Asp Ser Val Lys 
30 755 760 765 

Phe Asn Ser Ala Pro Leu Tyr Thr Thr Asn Cys lie Val Glu lie Ser 

770 775 780 

Asn Cys Glu Phe Thr Asn Arg Ser Asp lie lie Ser Lys Asn Cys Asp 
785 790 795 800 

35 Leu Ser Val Glu Asn Ser Met Phe Ser Ser Ser Gly He Thr Val Phe 

805 810 815 
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Lys Pro Met Ala Thr Ser Ser lie Thr Gly Leu Ser Thr Lys Ala Lys 

820 825 830 

lie Thr Asp Asn Thr Phe Phe Ala Thr Gly Asn Phe Ala Tyr His lie 

835 840 845 

5 Thr Asn Thr Pro Gly Leu Thr Ala Thr Ser Asn Ala Ala lie Lys Leu 

850 855 860 

Asp Asn lie Pro Glu Tyr Tyr lie Ser Gly Asn Lys lie Val Asn Cys 

865 870 875 880 

Asp Glu Ala Leu Val Leu Asn Asn Ser Gly Asn Arg Thr Asn Arg Leu 

10 885 890 895 

His Asn lie Thr Arg Asn Val lie Lys Asn Cys Arg lie Gly Ser Thr 

900 905 910 

Leu Tyr Asn Ser Tyr Gly lie Tyr Asn Arg Asn Lys lie Ser Asn Asn 

915 920 925 

15 His lie Gly Val Arg Leu Leu Asn Asn Ser Cys Phe Tyr Phe Asp Asn 

930 935 940 

Ala Pro Val lie Asn Glu Glu Asp Lys Gin Thr Phe lie Ser Asn Arg 

945 950 955 960 

Thr Trp Gin Leu Tyr Ser Ser Asn Gly Thr Phe Pro Leu Asn Phe His 

20 965 970 975 

Tyr Asn Ser Leu Gin Gly Gly Asp Thr Asp Thr Trp lie Tyr Asn Asp 

980 985 990 

Thr Tyr Thr Asn Arg Tyr lie Asp Val Ser Asn Asn His Trp Gly Asn 

995 1000 1005- 

25 Asn Asp Leu Phe Asp Pro Asn Gin Val Phe Asn Thr Pro Asp Leu Phe 

1010 1015 1020 

lie Trp lie Pro Phe Trp Asp Gly Leu Pro Asn Gly Arg Ser Gly Asn 

1025 1030 1035 1040 

Ser Ser Ala Glu Ala Val Glu Phe Gin Thr Ala Leu Asp Cys lie Gly 

30 1045 1050 1055 

Asn Ser Asp Tyr Leu Ser Ala Lys Val Ala Leu Lys Met Met Val Glu 

1060 1065 1070 

Thr Tyr Pro Glu Ser Asp Phe Ala lie Ala Ala Leu Lys Glu Leu Phe 

1075 1080 1085 

35 Arg lie Glu Lys Met Ser Gly Asn Asp Tyr Glu Gly Leu Lys Asp Tyr 

1090 1095 1100 
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Phe Arg Ser Asn Pro Thr He He Ser Ser Gin Asn Leu Phe Pro Thr 
1105 1110 1115 1120 

Ala Asp Phe Leu Ser Ala Arg Cys Asp He Val Cys Glu Asn Tyr Gin 
1125 1130 1135 

5 Ser Ala He Asp Trp Tyr Glu Asn Arg Leu Asn Ser Glu He Ser Tyr 
1140 1145 1150 

Gin Asp Ser Val Phe Ala Val He Asp Leu Gly Asp He Tyr Trp Asn 

1155 1160 1165 

Met Gin Leu Asp Ser Leu Arg Gly Thr Gly He Asp Leu Asn He Leu 
10 1170 1175 1180 

Ser Cys Glu Gin Arg Lys Ser Leu Glu Ser His Gin Asn Val Lys Asn 
1185 1190 1195 1200 

Tyr Leu Leu Ser Thr Leu Pro Glu Ser Thr Gly Thr Leu Leu Pro Pro 
1205 1210 1215 

15 Leu Glu Cys Asn Lys Ser Ser Leu Asp Lys Ser Lys He He Ser He 
1220 1225 1230 

Ser Pro Asn Pro Ala Lys Ala Val Val Thr He He Tyr Tyr Thr Asp 

1235 1240 1245 

Asn Pro Ser Cys Ser Val He Lys He Tyr Gly He Asn Gly Ala Ser 
20 1250 1255 1260 

Ala Asp He Thr Gly Leu Pro Lys His Leu Ser Glu Gly Tyr Tyr Ser 
1265 1270 1275 1280 

He Gin Phe Asn Thr Ser Asn Phe Asp Pro Gly Phe Tyr Leu Val Thr 
1285 1290 " 1295 

25 Leu Asn Val Asp Gin Lys He He Asp Thr Glu Lys Leu Arg He Lys 
1300 1305 1310 

(2) INFORMATION FOR SEQ ID NO: 81 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

35 (ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

5 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . . 938 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

Ser Glu Asn Tyr Arg Tyr Gly Lys lie Thr Asn Gin lie Met Ala lie 
15 10 15 

Met Met Lys Ser lie Val Phe Arg Ala Phe Leu Thr lie Leu Leu Ser 
15 20 25 30 

Trp Ala Ala lie Thr Asn Pro Thr Ala Gin Glu lie Ser Gly Met Asn 

35 40 45 

Ala Ser Cys Leu Ala Ala Pro Ala Gin Pro Asp Thr lie Leu Tyr Glu 
50 55 60 

20 Ser Phe Glu Asn Gly Pro Val Pro Asn Gly Trp Leu Glu lie Asp Ala 
65 70 75 80 

Asp Ala Asp Gly Ala Thr Trp Gly Ser Pro Ser Gly Ser Phe Ser Val 

85 90 95 

Pro Tyr Gly His Asn Gly Leu Cys Thr Tyr Ser His lie Arg Ser Gly 
25 100 105 110 

lie Ser Thr Ala Gly Asn Tyr Leu lie Thr Pro Asn lie Glu Gly Ala 

115 120 125 

Lys Arg Val Lys Tyr Trp Val Cys Asn Gin Tyr Ser Thr Asn Pro Glu 
130 135 140 

30 His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala lie Glu Asp Phe 
145 150 155 160 

Val Leu Leu Phe Asp Asp Ser lie Thr Gly Lys Pro Thr Pro Leu Val 

165 170 175 

Trp Arg Arg Arg lie Val Asp Leu Pro Glu Gly Thr Lys Tyr lie Ala 
35 180 185 190 

Trp Arg His Tyr Lys Val Thr Asp Ser His Thr Glu Phe Leu Lys Leu 



Asp 

Asp 
5 225 
Asn 

Gin 

10 He 

Arg 

Glu 
15 305 

Asp 

Trp 

20 Leu 

Leu 

Tyr 
25 385 
Val 

Ala 

30 Glu 

Asn 

Thr 
35 465 
Pro 



195 
Asp Val 
210 

Phe Thr 

Tyr Pro 

Leu Ser 

Lys Asp 
275 
Asp Asn 
290 

Ser lie 

Ala He 

Leu Val 

Asn Ala 
355 
Ser Ala 
370 

Leu He 

Ser Thr 

Ser Thr 

Thr Met 
435 
Leu Pro 
450 

Asp He 
Ala Ser 



Thr Val 

Val He 

Glu Asp 
245 
Gly Tyr 
260 

Val Ser 

Pro Leu 

Glu Ser 

Leu Tyr 
325 
He Asp 
340 

Tyr Asp 

Ser Tyr 

Thr Pro 

Gin Asp 
405 
Thr Gly 
420 

Thr Ala 
Glu Gly 
Tyr Phe 
Glu Pro 



Tyr Arg 
215 
Asn He 
230 

Tyr Gin 

Asn He 

He Leu 

Gin Val 
295 
Ser Thr 
310 

Glu Asn 

Ala Asp 

Ala Phe 

Val Pro 
375 
Lys Val 
390 

Ala Asn 

Thr Ala 

Lys Pro 

Thr Lys 
455 
Leu Lys 
470 

Glu Pro 



200 

Ser He 

Gly Gin 

Pro Glu 

Tyr Ala 
265 
Glu Tyr 
280 

Glu Tyr 
Val Cys 

Phe Glu 

Gly Asp 
345 
Pro Gly 
360 

Gly He 

Glu Gly 

Trp Ala 

Val Gly 
425 
Thr Gly 
440 

Tyr He 
Leu Asp 
Val Thr 
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Glu Gly 

Asn Val 
235 
Gly Lys 
250 

Asn Gly 
Val Asp 
Cys Val 

Gly Thr 

315 
Asn Gly 
330 

Gly Phe 

His Asn 

Gly Pro 

Ala Lys 
395 
Ala Glu 
410 

Asp Phe 

Ala Trp 

Ala Trp 

Asp He 
475 
Asp Phe 



205 
Pro Glu 
220 

Gly Arg 

Gly Asn 

Thr Leu 

Ser Thr 
285 
Thr Ala 
300 

Leu His 

Pro Val 

Ser Trp 

Gly Gly 
365 
Val Thr 
380 

Arg Val 

His Tyr 

Val He 

Tyr Glu 
445 
Arg His 
460 

Thr Val 
Val Val 



Pro Ala Thr 

Leu Thr Trp 
240 

Glu Glu Leu 

255 
Leu Ala Gin 
270 

Tyr Ser Leu 
Val Tyr Asp 

Tyr Ala Thr 

320 

Pro Asn Gly 

335 
Gly His Tyr 
350 

His Cys Ser 

Pro Asp Asn 

Lys Tyr Trp 
400 

Ala Val Met 
415 

Leu Phe Glu 
430 

Arg Thr He 

Tyr Asn Cys 

Phe Gly Thr 
480 

Ser Leu He 
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485 490 495 

Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn Gly Tyr Glu 

500 505 510 

Pro Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala Gly Tyr Asn 
5 515 520 525 

lie Tyr Ala Asn Gly Ser Leu Leu Val His lie Gin Asp Pro Thr Val 

530 535 540 

Leu Glu Tyr lie Asp Glu Thr Tyr Ser Ser Arg Asp Asp Gin Val Glu 
545 550 555 560 

10 Val Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn lie Glu Ser Gin 

565 570 575 

Ser Val Cys Asp Lys Leu lie Tyr Asp Ser Gin Ser Asp lie lie Leu 

580 585 590 

Tyr Glu Gly Phe Glu Ala Gly Ser lie Pro Glu Gly Trp Leu Leu lie 
15 595 600 605 

Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr Met 

610 615 620 

Tyr Gly His Asp Ser Glu Lys Cys lie Ala Ser Pro Ser Tyr Leu Pro 
625 630 635 640 

20 Met lie Gly Val Leu Thr Pro Asp Asn Tyr Leu Val Thr Pro Arg Leu 

645 650 655 

Glu Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val 

660 665 670 

Tyr Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala 
25 675 -680 685 

Val Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Met Thr Ala Lys Ala 

690 695 700 

Asn Gly Ala Trp Tyr Glu Arg Thr lie Thr Leu Pro Ala Gly Thr Lys 
705 710 715 720 

30 Tyr lie Ala Trp Arg His Tyr Asp Cys Thr Asp Met Phe Phe Leu Leu 

725 730 735 

Leu Asp Asp lie Thr Val Tyr Arg Ser Thr Glu Thr Val Pro Glu Pro 

740 745 750 

Val Thr Asp Phe Val Val Ser Leu lie Glu Asn Asn Lys Gly Arg Leu 
35 755 760 765 

Lys Trp Asn Tyr Pro Asn Gly Tyr Glu Pro Asp Lys Thr Asp Asp Lys 



175 



770 775 780 

Lys Pro Leu Gin Leu Thr Gly Tyr Asn He Tyr Ala Asn Gly Ser Leu 
785 790 795 800 

Leu Val His He Gin Asp Pro Thr Val Leu Glu Tyr He Asp Glu Thr 
5 805 810 815 

Tyr Ser Ser Arg Asp Gly Gin Val Glu Met Glu Tyr Cys Val Thr Ala 

820 825 830 

Val Tyr Asn Asp Asn He Glu Ser Gin Ser Val Cys Asp Lys Leu Asn 
835 840 845 

10 Tyr Thr He Thr Ser Leu Asp Asn He Gin Ser Asp Thr Ser Leu Lys 
850 855 860 

He Tyr Pro Asn Pro Ala Ser Tyr Val Val Arg He Glu Gly Leu Ser 
865 870 875 880 

Arg Ser Lys Ser Thr He Glu Leu Tyr Asn Ala Leu Gly He Cys He 
15 885 890 895 

Leu Arg Glu Glu Thr His Ser Glu Lys Thr Glu He Asp Val Ser Arg 

900 905 910 

Leu Asn Asp Gly Val Tyr Leu He Lys Val Val Gly Gly Asn Lys Thr 
915 920 925 

20 Thr Thr Glu Lys Val Glu He Lys Arg Pro 
930 935 

(2) INFORMATION FOR SEQ ID NO: 82 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...1251 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

Lys Trp Lys Leu Ala Cys Ala Phe Asp Cys Ala Cys Cys Phe Asp Pro 
15 10 15 

Phe Val Val Thr Asn Glu Val lie lie Met Met Lys Arg Tyr Thr lie 
10 20 25 30 

lie Leu Ala Val Phe Leu Leu Phe Cys Thr Val Phe Thr Phe Gin lie 

35 40 45 

Lys Ala Arg Pro Tyr Glu Arg Phe Ala Asp Val Glu Lys Pro Trp lie 
50 55 60 

15 Gin Lys His Ser Met Asp Ser Lys Leu Val Pro Ala Asn Lys Gly Asn 
65 70 75 80 

Leu lie Gin Ala Glu lie Val Tyr Gin Ser Val Ser Glu His Ser Asp 

85 90 95 

Leu Val lie Ser Pro Val Asn Glu lie Arg Pro Ala Asn Arg Phe Pro 
20 100 105 110 

Ser His Arg Lys Ser Phe Phe Ala Glu Asn Leu Arg Ala Ser Pro Pro 

115 120 125 

Val Val Pro Val Ala Val Asp Lys Tyr Ala Val Pro Val Ala Asn Pro 
130 135 140 

25 Met Asp Pro Glu Asn Pro Asn Ala Trp Asp Val Thr Leu Lys lie Thr 
145 150 155 160 

Thr Lys Ala Val Thr Val Pro Val Asp Val Val Met Val lie Asp Gin 

165 170 175 

Ser Ser Ser Met Gly Gly Gin Asn He Ala Arg Leu Lys Ser Ala He 
30 180 185 190 

Ala Ser Gly Gin Arg Phe Val Lys Lys Met Leu Pro Lys Gly Thr Ala 

195 200 205 

Thr Glu Gly Val Arg He Ala Leu Val Ser Tyr Asp His Glu Pro His 
210 215 220 

35 Arg Leu Ser Asp Phe Thr Lys Asp Thr Ala Phe Leu Cys Gin Lys He 
225 230 235 240 
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Arg Ala Leu Thr Pro lie Trp Gly Thr His Thr Gin Gly Gly Leu Lys 

245 250 255 

Met Ala Arg Asn lie Met Ala Thr Ser Thr Ala Val Asp Lys His lie 
260 265 270 

5 lie Leu Met Ser Asp Gly Leu Ala Thr Glu Gin Tyr Pro Val Lys Asn 
275 280 285 

Val Thr Thr Ala Asp Phe lie Gly Lys Thr Gly Asn Ala Asn Asp Pro 

290 295 300 

lie Asp Leu Val lie Gin Gly Ala lie Asn Phe Pro Thr Asn Tyr Val 
10 305 310 315 320 

Ser Asn Asn Pro Ser Thr Pro Leu Thr Pro Asn Tyr Pro Thr His Ser 

325 330 335 

Ser Lys Val Gly Arg Arg Asn Leu Pro Glu Ser Lys Phe Asp Tyr Ser 
340 345 350 

15 Asn Leu Ser Ala Arg lie Thr Phe Asp Gly Val Ala Gly Ala Leu Val 
355 360 365 

Tyr Glu Pro Arg Phe Pro His Pro Tyr Tyr Tyr Tyr Phe Pro Cys Asn 

370 375 380 

Ala Ala lie Asn Glu Ala Gin Phe Ala Lys Asn Ser Gly Tyr Thr lie 
20 385 390 395 400 

His Thr lie Gly Tyr Asp Leu Gly Asp Phe Ala Leu Ala Asn Asn Ser 

405 410 415 

Leu Lys Leu Thr Ala Thr Asp Glu Asn His Phe Phe Thr Ala Thr Pro 
420 425 430 

25 Ala Asn Leu Ala Ala Ala Phe Asp Asn lie Ala Gin Thr lie Asn lie 
435 440 445 

Gly lie Gin Arg Gly Glu Val Thr Asp Phe Val Ala Pro Gly Phe lie 

450 455 460 

Val Lys Asn Leu Thr Gin Ser Gly Asp Val Thr His Leu Leu Asn Val 
30 465 470 475 480 

Ser Asn Gly Thr Val His Tyr Asp Val Ser Thr Lys Lys Leu Thr Trp 

485 490 495 

Thr Thr Gly Thr lie Leu Ser Ser Ser Glu Ala Thr lie Thr Tyr Arg 
500 505 510 

35 lie Tyr Ala Asp Leu Asp Tyr lie Gin Asn Asn Asp lie Pro Val Asn 
515 520 525 
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Thr Thr Ser Ala lie Gly Pro Asp Leu Gly Gly Phe Asp Thr Asn Thr 

530 535 540 

Glu Ala Lys Leu Thr Tyr Thr Asn Ser Asn Gly Glu Pro Asn Gin Gin 
545 550 555 560 

5 Leu lie Phe Pro Arg Pro Thr Val Lys Leu Gly Tyr Gly Val lie Lys 

565 570 575 

Arg His Tyr Val Leu Val Asn Lys Asp Gly Gin Pro lie Gin Ala Asn 

580 585 590 

Gly Thr Val Val Ser Ser Leu Ser Glu Ala His Val Leu Gin Ser Gin 
10 595 600 605 

Asp Phe Phe Leu Pro Ser Gly Gly Gly His lie Val Pro Lys Trp lie 

610 615 620 

Lys Leu Asp Lys Thr Thr Glu Ala Leu Gin Tyr Tyr Ser Val Pro Pro 
625 630 635 640 

15 Thr Asn Thr Val lie Thr Thr Ala Asp Gly Lys Arg Tyr Arg Phe Val 

645 650 655 

Glu Val Pro Gly Ser Thr Pro Asn Pro Gly Gin lie Gly lie Ser Trp 

660 665 670 

Lys Lys Pro Ala Gly Asn Ala Tyr Phe Ala Tyr Lys Leu Leu Asn Tyr 
20 675 680 685 

Trp Met Gly Gly Thr Thr Asp Gin Gin Ser Glu Trp Asp Val Thr Ser 

690 695 700 

Asn Trp Thr Gly Ala Gin Val Pro Leu Thr Gly Glu Asp Val Glu Phe 
705 710 715 - - 720 

25 Ala Thr Thr Glu Asn Phe Gly Ser Pro Ala Val Ala Asp Leu His Val 

725 730 735 

Pro Thr Thr Asn Pro Lys lie lie Gly Asn Leu lie Asn Asn Ser Asp 

740 745 750 

Lys Asp Leu Val Val Thr Thr Ser Ser Gin Leu Thr lie Asn Gly Val 
30 755 760 765 

Val Glu Asp Asn Asn Pro Asn Val Gly Thr lie Val Val Lys Ser Ser 

770 775 780 

Lys Asp Asn Pro Thr Gly Thr Leu Leu Phe Ala Asn Pro Gly Tyr Asn 
785 790 795 800 

35 Gin Asn Val Gly Gly Thr Val Glu Phe Tyr Asn Gin Gly Tyr Asp Cys 

805 810 815 
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Ala Asp Cys Gly Met Tyr Arg Arg Ser Trp Gin Tyr Phe Gly lie Pro 

820 825 830 

Val Asn Glu Ser Gly Phe Pro lie Asn Asp Val Gly Gly Asn Glu Thr 
835 840 845 

5 Val Asn Gin Trp Val Glu Pro Phe Asn Gly Asp Lys Trp Arg Pro Ala 
850 855 860 

Pro Tyr Ala Pro Asp Thr Glu Leu Gin Lys Phe Lys Gly Tyr Gin lie 
865 870 875 880 

Thr Asn Asp Val Gin Ala Gin Pro Thr Gly Val Tyr Ser Phe Lys Gly 
10 885 890 895 

Met lie Cys Val Cys Asp Ala Phe Leu Asn Leu Thr Arg Thr Ser Gly 

900 905 910 

Val Asn Tyr Ser Gly Ala Asn Leu lie Gly Asn Ser Tyr Thr Gly Ala 
915 920 925 

15 lie Asp lie Lys Gin Gly lie Val Phe Pro Pro Glu Val Glu Gin Thr 
930 935 940 

Val Tyr Leu Phe Asn Thr Gly Thr Arg Asp Gin Trp Arg Lys Leu Asn 
945 950 955 960 

Gly Ser Thr Val Ser Gly Tyr Arg Ala Gly Gin Tyr Leu Ser Val Pro 
20 965 970 975 

Lys Asn Thr Ala Gly Gin Asp Asn Leu Pro Asp Arg lie Pro Ser Met 

980 985 990 

His Ser Phe Leu Val Lys Met Gin Asn Gly Ala Ser Cys Thr Leu Unk 
995 1000 1005- 

25 lie Leu Tyr Asp Lys Leu Leu Lys Asn Thr Thr Val Asn Asn Gly Asn 
1010 1015 1020 

Gly Thr Gin lie Thr Trp Arg Ser Gly Asn Ser Gly Ser Ala Asn Met 
1025 1030 1035 1040 

Pro Ser Leu Val Met Asp Val Leu Gly Asn Glu Ser Ala Asp Arg Leu 
30 1045 1050 1055 

Trp lie Phe Thr Asp Gly Gly Leu Ser Phe Gly Phe Asp Asn Gly Trp 

1060 1065 1070 

Asp Gly Arg Lys Leu Thr Glu Lys Gly Leu Ser Gin Leu Tyr Ala Met 
1075 1080 1085 

35 Ser Asp He Gly Asn Asp Lys Phe Gin Val Ala Gly Val Pro Glu Leu 
1090 1095 1100 
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Asn Asn Leu Leu lie Gly Phe Asp Ala Asp Lys Asp Gly Gin Tyr Thr 
1105 1110 1115 1120 

Leu Glu Phe Ala Leu Ser Asp His Phe Ala Lys Gly Ala Val Tyr Leu 
1125 1130 1135 

5 His Asp Leu Gin Ser Gly Ala Lys His Arg lie Thr Asn Ser Thr Ser 
1140 1145 1150 

Tyr Ser Phe Asp Ala Lys Arg Gly Asp Ser Gly Ala Arg Phe Arg Leu 

1155 1160 1165 

Ser Tyr Gly Cys Asp Glu Asn Val Asp Asp Ser His Val Val Ser Thr 
10 1170 1175 1180 

Asn Gly Arg Glu lie He He Leu Asn Gin Asp Ala Leu Asp Cys Thr 
1185 1190 1195 1200 

Val Thr Leu Phe Thr He Glu Gly Lys Leu Leu Arg Arg Leu Lys Val 
1205 1210 1215 

15 Leu Ala Gly His Arg Glu Val Met Lys Val Gin Thr Gly Gly Ala Tyr 
1220 1225 1230 

He Val His Leu Gin Asn Ala Phe Thr Asn Asp Val His Lys Val Leu 

1235 1240 1245 

Val Glu Tyr 
20 1250 



(2) INFORMATION FOR SEQ ID NO: 83 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Porphyromonas gingivalis 

35 

(ix) FEATURE: 



4 



181 



(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 426 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

5 

Thr Met Glu Val Lys Lys Asn Thr Val Val Leu Arg Leu Leu lie Trp 
15 10 15 

Phe Val Ala lie Leu Leu Phe His Ser Ser Arg Leu Trp Gly Gin Glu 
20 25 30 

10 Gly Glu Gly Ser Ala Arg Tyr Arg Phe Lys Gly Phe Val Asp Thr Tyr 
35 40 45 

His Ala Val Arg Ser Ser Ser Pro Phe Asp Phe Met Ser Ser Arg Thr 

50 55 60 

Arg Val Arg Gly Glu Leu Glu Arg Ser Phe Gly Asn Ser Lys Val Ala 
15 65 70 75 80 

Val Ser Val Asn Ala Thr Tyr Asn Ala Leu Leu Lys Asp Glu Thr Gly 

85 90 95 

Leu Arg Leu Arg Glu Ala Phe Phe Glu His Gin Glu Glu His Trp Gly 
100 105 110 

20 Leu Arg Leu Gly Arg Gin lie Val lie Trp Gly Ala Ala Asp Gly Val 
115 120 125 

Arg lie Thr Asp Leu lie Ser Pro Met Asp Met Thr Glu Phe Leu Ala 

130 135 140 

Gin Asp Tyr Asp Asp lie Arg Met Pro Val Asn Ala Leu Arg Phe Ser 
25 145 150 155 160 

Val Phe Asn Glu Ser Met Lys Val Glu Val Val Val Leu Pro Val Phe 

165 170 175 

Glu Gly Tyr Arg Leu Pro Val Asp Pro Arg Asn Pro Trp Asn lie Phe 
180 185 190 

30 Ser Leu Ser Pro lie Ala Gin Gly Met Asn lie Val Trp Lys Glu Glu 
195 200 205 

Ala Gly Lys Pro Ala Phe Lys Val Ala Asn lie Glu Tyr Gly Ala Arg 

210 215 220 

Trp Ser Thr Thr Leu Ser Gly lie Asp Phe Ala Leu Ala Ala Leu His 
35 225 230 235 240 

Thr Trp Asn Lys Met Pro Val lie Glu Val Gin Gly lie Val Pro Thr 
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245 250 255 

Glu lie lie Val Ser Pro Arg Tyr Tyr Arg Met Gly Phe Val Gly Gly 

260 265 270 

Asp Leu Ser Val Pro Val Gly Gin Phe Val Phe Arg Gly Glu Ala Ala 
5 275 280 285 

Phe Asn lie Asp Lys His Phe Thr Tyr Lys Ser His Ala Glu Gin Glu 

290 295 300 

Gly Phe Gin Thr lie Asn Trp Leu Ala Gly Ala Asp Trp Tyr Ala Pro 
305 310 315 320 

10 Gly Glu Trp Met lie Ser Gly Gin Phe Ser Met Glu Ser lie Phe Arg 

325 330 335 

Tyr Arg Asp Phe lie Ser Gin Arg Gin His Ser Thr Leu lie Thr Leu 

340 345 350 

Asn Val Ser Lys Lys Phe Phe Gly Ser Thr Leu Gin Leu Ser Asp Phe 
15 355 360 365 

Thr Tyr Tyr Asp Leu Thr Gly Lys Gly Trp Phe Ser Arg Phe Ala Ala 

370 375 380 

Asp Tyr Ala Leu Asn Asp Gin lie His Leu Met Ala Gly Tyr Asp Trp 
385 390 395 400 

20 Phe Ser Ser Lys Gly Ser Gly lie Phe Asp Arg Tyr Lys Asp Asn Ser 

405 410 415 

Glu Leu Trp Phe Lys Ala Arg Tyr Ser Phe 
420 425 

25 (2) INFORMATION FOR SEQ ID NO: 84 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



35 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 
5 (B) LOCATION 1 . . .252 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

Arg Gly Ser Ser Ser Gly lie Ser Ala Arg Gly Arg Asp Met Arg Ser 
10 1 5 10 15 

Leu Phe Leu Ser Ala Leu Arg Ser Ser Ser Leu His Gly Ser Glu Arg 

20 25 30 

Arg Ser Arg lie Ser Ser Ser Val Val Met Ser lie Arg Gin Lys lie 
35 40 45 

15 Arg Leu Phe His Leu Ser Val Cys Ala Gin Thr His Asp His Leu lie 
50 55 60 

Glu lie His Leu Val Cys lie Glu Phe Gly Ala lie Asp Thr Asp Glu 
65 70 75 80 

Phe Arg Leu Ser Ser His Ala Tyr Thr Thr Ser Pro Thr His Thr Gly 
20 85 90 95 

Ala lie His His Asn Cys lie Glu Arg Ser Tyr Gly Arg Tyr Leu Val 

100 105 110 

Thr Phe Gly Gin Glu Arg Asn Glu Leu His His His Ser Arg Pro Asp 
115 120 125 - 

25 Arg Asn Ala Glu Val Tyr Arg Phe Pro Phe Asp Asn Ala Phe His Ser 
130 135 140 

lie Arg Tyr Glu Ala Phe Arg Pro lie Arg Pro lie Val Cys His Asp 
145 150 155 160 

Asp His Phe lie Ala lie Gly Ser His Leu Phe Phe Lys Asp Asn Gin 
30 165 170 175 

lie Phe Ser Ser Gly Ser Gin Tyr Asp Asn Tyr Thr Val Ala Cys Phe 

180 185 190 

Val Glu Ser Leu His Asp Arg Glu Gin Arg Ser His Thr His Thr Ala 
195 200 205 

35 Ser Gly Thr Asn His Cys Ala Asp Leu Phe Asp Met Arg Thr Leu Ser 
210 215 220 
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5 



25 



Gin Arg Thr Tyr His lie Arg Asp lie Val Ala Asp Phe Glu Phe Gly 

225 230 235 240 

Gin Phe Leu Gly Arg Phe Ala His Gly Leu Asn His 
245 250 

(2) INFORMATION FOR SEQ ID NO: 85 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 66 amino acids 
10 { B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

20 (ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .266 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 



lie lie Tyr Arg Ser Thr Met Lys Leu Leu Leu Tyr Leu Leu Leu Val 
15 10 15 

Leu Ser Thr Leu Ser Pro Met Tyr Ser Gin Met Leu Phe Ser Glu Asn 
20 25 30 

30 Leu Thr Met Asn lie Asp Ser Thr Lys Thr lie Gin Gly Thr lie Leu 
35 40 45 

Pro Val Leu Asp Phe Lys Thr Glu Lys Glu Asn Val Phe Thr Phe Lys 

50 55 60 

Asn Thr Ala Asn Leu Asn Leu Leu lie Lys His Gly Gin Val lie Asn 
35 65 70 75 80 

Leu lie Asn Lys Leu Glu Phe Ser Thr Tyr Gly Asn Lys Val Thr Val 
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85 90 95 

Ser Gly Gly Tyr Val His Thr Glu Tyr Arg Tyr Leu Leu His His Val 

100 105 110 

Phe Glu Val Tyr Pro Tyr Val Glu Ser Gin Trp Ala Glu Ser Arg Gly 
5 115 120 125 

Met Lys Tyr Lys Val Ser Thr Gly Leu Gin Ser Arg Tyr Arg Leu Val 

130 135 140 

Asn Ser Asp Asn Cys Leu Met Phe Ala Thr Leu Gly Val Phe Phe Glu 
145 150 155 160 

10 Phe Glu Lys Trp Glu Gin Pro Ala Thr Ser Leu Phe Ala Gly Thr Tyr 

165 170 175 

Ala Tyr Ser Arg Ser lie Lys Ser His Leu Ser lie Ser Phe Arg His 

180 185 190 

Arg Leu Gly Glu His Trp Glu Phe Thr Thr Thr Ala lie His Gin Gly 
15 195 200 205 

Lys Pro Asp Ser Tyr Phe Lys Lys Ala Arg Phe Gly Gly Ala lie Asp 

210 215 220 

Leu Lys Tyr His lie Thr Pro Thr lie Gly He Arg Gly Ala Tyr Arg 
225 230 235 240 

20 He He Tyr Asp Thr Ala Pro He Val Pro Val Arg Lys Asp Tyr Asn 

245 250 255 

Thr Val Asp Val Gly He Asp He Ser Phe 
260 265 

25 (2) INFORMATION FOR SEQ ID NO: 86 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 9 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
<iii) HYPOTHETICAL: YES 
(vi) ORIGINAL SOURCE: 



35 
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(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 
5 (B) LOCATION 1...329 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 



Glu Thr Asn 
10 1 

Thr Asn Arg 

Leu Leu Ala 
35 

15 Ala Thr Asp 
50 

Arg lie His 
65 

Asp Thr Tyr 

20 

Leu Gly Glu 

lie Arg Thr 
115 

25 Asn Asn Ala 
130 

Leu Tyr Arg 
145 

Leu Arg Leu 

30 

Tyr Ser Thr 

Asn Ala lie 
195 

35 Phe Pro Leu 
210 



Ser Trp Val Ser 
5 

Arg Tyr Ala Phe 
20 

Trp Gly Gin Asp 

Ser Ser Ser Arg 
55 

Ser Ala Tyr Met 
70 

Leu Ser Pro Leu 
8 5 

Lys Thr Phe Pro 
100 

Gly His Glu Leu 

His Phe Tyr Ser 
135 

Leu Gly Ala Lys 
150 

Ala Phe Gly Pro 
165 

Arg Asn Gly Asn 
180 

Ala Gin Ala Ser 

Tyr Phe Arg Leu 
215 



Ser Asp Cys Asn 
10 

Val Leu Pro Leu 
25 

Ser Ser His Gly 
40 

Glu Leu Pro Thr 

Val Gly Gly Gly 
75 

Arg Tyr Gly Gly 
90 

Leu Lys Ala Ser 
105 

Asp Phe Ala Leu 
120 

Leu Leu Tyr Asn 

His Leu Arg Ala 
155 

Gly Leu Glu lie 
170 

Asn Pro Ala Thr 
185 

lie Gly Tyr Tyr 
200 

Leu Ser Gin lie 



Ser Thr Thr Met Lys 
15 

Leu Leu Leu Thr Gly 
30 

Ser Asn Thr Ala Phe 
45 

Glu Gin Ser Ala Tyr 
60 

Gly Ser lie Thr Arg 
80 

Trp Thr Leu Asn Leu 
95 

Asp Ser Arg Trp Met 
110 

Met Asp Asn Pro Ala 

125 " 
Gly Ser Ala Ala Ala 
140 

Ala Trp Met Asp Asn 
160 

Gly Leu Gly Gly lie 
175 

Leu Lys Leu Tyr Thr 
190 

Val Pro Ser Glu Thr 
205 

Asn Leu Phe Gly lie 
220 
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Ala Tyr Gly Asn Gly Phe Gly Glu Ser Tyr Tyr Glu Asn Phe Leu Leu 
225 230 235 240 

Asn Asn Gly lie Ala Gly Ser Leu His Phe Thr Tyr Pro Gly Lys Phe 

245 250 255 

Thr Arg Phe Thr Thr Leu lie Thr Ala Asp lie Pro lie Arg Asn Phe 

260 265 270 

Cys Thr Leu Arg Val Gly Tyr Arg Tyr Ser His Leu Gly Ser Ser Leu 

275 280 285 

Asn Ala Leu Asp Thr Arg lie His Ser His Thr Ala Phe lie Gly Phe 

290 295 300 

Val Thr Glu Phe Tyr Arg Phe Arg Gly Arg Lys Ala Met Asn Thr Gly 
305 310 315 320 

Arg Arg Thr Ser Leu Tyr Tyr His Asp 
325 

(2) INFORMATION FOR SEQ ID NO: 87 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . . 814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 



Pro Arg Phe His Pro Gly Gly Thr His Ala Gin Leu Cys Arg Asn Gly 
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1 

Arg Arg Asn Gin 
20 

Ala Pro Arg Phe 
35 

Ala Leu Val Val 
50 

Glu Glu Gly He 
65 

Ala Val Glu Thr 

Tyr Ser Val Gly 
100 

He Ser Gly Leu 
115 

Gin Leu Lys Ser 
130 

Leu Asp Glu Ala 
145 

He Asp Leu He 

Arg Ala Tyr Leu 
180 

Glu Ala Arg He 
195 

Val Ala Lys Leu 
210 

Leu Phe He Gly 
225 

Thr Ala Leu Ser 

He Val Gly Glu 
260 

Val Ala Ala Asn 
275 

Ala Asp Ser Arg 



5 

Lys Ser Asn Gin 

Phe He Met Cys 
40 

Cys Leu Leu Phe 
55 

Trp Asn Thr Leu 
70 

Pro Lys Lys Val 
85 

Lys Glu Ala Pro 

Ser Asp Thr Ser 
120 

Leu Val He Tyr 
135 

Gly Arg Val Thr 
150 

Asp Lys Thr Leu 
165 

Ala Gly Gly Phe 

Pro Ala Thr Tyr 
200 

Asp Asn Asp Arg 
215 

Lys Glu Thr Asp 
230 

Leu Asn Leu Pro 
245 

Asp He Cys Phe 

Gin Ser Phe Glu 
280 

Leu Tyr Val Thr 



10 

Ser Ser Cys Phe 
25 

Lys He Arg Phe 

Thr Ser Phe Ser 
60 

Leu Ala He His 
75 

Phe Ala Val Ala 
90 

His Glu Ala Lys 
105 

Val Ser Ser He 

Tyr Ala Ser Gly 
140 

Asn Val Pro Ala 
155 

Asn Arg Leu Leu 
170 

Gly Leu Ser Val 
185 

Ala Lys Gly Thr 

Leu Leu Met Leu 
220 

Asn Leu Gin Asp 
235 

Met Gly Ser Val 
250 

Leu Leu Ala Asp 
265 

Pro Glu Leu Leu 
Asp Arg Gly Leu 



15 

Gly Gly Gly Thr 
30 

Ser Leu Leu Gin 
45 

Leu Gin Ala Gin 

Lys Thr Glu Lys 
80 

Asn Gly Val Leu 
95 

He Phe Asp Arg 
110 

Ala Tyr Ser Glu 
125 

Asn He Asp He 

Leu Lys Asp Asn 
160 

He Val Gly Asn 
175 

Leu Asp Val Ala 
190 

Lys Val Thr Asp 
205 

Lys Glu Gly Gin 

Pro Ala Ala Trp 
240 

Thr Gly Leu Gly 
255 

Gly Arg Val Tyr 
270 

Leu Ser Ser Ser 
285 

Phe He Cys Ala 
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290 295 300 

Glu Asn Arg lie Tyr Phe lie Glu Lys Gly Arg Lys Thr Thr Gin Phe 
305 310 315 320 

Pro lie Ala Asp Val Leu Gly Val Gly Ala Met Asn Glu Ser Asn Thr 
5 325 330 335 

Ala Tyr lie Ala Leu Gly Glu Glu Gly Leu Ala Ser Leu Leu Leu Ala 

340 345 350 

Glu Gly Ser Thr Ala Glu Ala Met Pro Val Ala Phe Asp Gly Pro Gly 
355 360 365 

10 Asp Asn Asp Phe Tyr Glu Met Arg Phe Ser His Gly Arg Leu Tyr Ala 
370 375 380 

Ala Ser Gly Leu Trp Gly Thr Asn Leu Met Gly His Ala Gly Met Val 
385 390 395 400 

Lys Leu Tyr Asp Gly Asn Arg Trp Thr Asn Phe Asp Lys Lys Thr Val 
15 405 410 415 

Gin Glu Gin Leu Gly Gly Gly Phe Ser Phe Asn Asp Ala lie Asp lie 

420 425 430 

Ala Val Ser Asn Gly Asp Pro Asp His Phe Phe Val Gly Thr Trp Gly 
435 440 445 

20 Asn Gly Leu Phe Glu Phe Lys Asp Gly Lys Ala lie Ala Arg Tyr Ser 
450 455 460 

Gly Asn Glu Thr Ala lie Ala Glu Cys Asn Pro Gly Asp Ala Arg Val 
465 470 475 480 

Lys Ala lie Ala Phe Asp Asn Lys Gly Asn Leu Trp Gly Ttir Leu Gly 
25 485 490 495 

Ala Val Gly Lys Asn lie Phe Met Tyr Asp Pro Gin Ser Ser Thr Trp 

500 505 510 

His Ser Phe Ser Tyr Pro Asp Val Ala Asn Leu Ala Ser Phe Gly Asn 
515 520 525 

30 Met lie lie Leu Pro Asn Gly Asp Lys Trp Val Asn lie Leu His Arg 
530 535 540 

Ser Gly Gly Ser Thr Arg Lys Gly Val Leu lie Phe Asn Asp Arg Gly 
545 550 555 560 

Thr Pro Glu Thr Thr Ser Asp Asp Ser His Leu Tyr Val Glu Gin Phe 
35 565 570 575 

Val Asn Arg Leu Gly Ala Ala lie Gly His Lys Thr He Tyr Ala Met 
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580 585 590 

Ala Val Asp His Asn Gly Ser Val Trp Met Gly Ser Asp lie Gly lie 

595 600 605 

Phe Gly Val Tyr Asn Ala Ala Gly Val Leu Ser Ser Thr Ser Thr Pro 
5 610 615 620 

lie Ala Val Arg Pro Val Gly Gly Glu Glu Pro Asn Leu Tyr Tyr Val 
625 630 635 640 

Leu Asp Lys Val Thr Val Thr Asp lie Val Val Asp Lys Leu Asn His 
645 650 655 

10 Lys Trp Val Ala Thr Gin Gly Thr Gly Leu Tyr Leu Leu Ser Glu Asp 
660 665 670 

Cys Ser Lys lie Leu Ala Gin Phe Thr Val Glu Asn Ser Pro Leu Leu 

675 680 685 

Ser Asn Asn lie Leu Ser Leu Ala Leu Asn Asp Asp Asn Gly Leu Leu 
15 690 695 700 

Tyr lie Gly Thr Ala Asp Gly Leu Met Thr Phe Gin Thr Gly Thr Gly 
705 710 715 720 

Ser Gly Ser Ala Ser Glu Leu Asp Gly Val Tyr Val Tyr Pro Asn Pro 
725 730 735 

20 Leu Arg Pro Glu Tyr Pro Asp Gly Val Thr lie Ala Gly Leu Gin Ala 
740 745 750 

Gly Cys Ser Val Lys lie Thr Asp Thr Thr Gly Arg Leu Leu Tyr Gin 

755 760 765 

Thr Glu Ser Val Thr Thr Glu Val Lys Trp Asn Ala Arg Gly Ala Asp 
25 770 775 780 

Gly Asn Arg Val Ala Ser Gly Val Tyr Ala Val Ala Val Tyr Asp Pro 
785 790 795 800 

Val Ser Lys Lys Ser Lys Leu lie Arg Phe Ala Val lie Arg 
805 810 

30 

(2) INFORMATION FOR SEQ ID NO: 88 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
10 (A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .451 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

15 Lys Thr Ser Tyr Arg Asn Asn Met Arg Phe Gin His Tyr Leu lie Cys 
15 10 15 

Thr Ala Ala Val Ala Ala Leu Ala Ala Asn Pro Leu Thr Gly Gin Ser 

20 25 30 

Asn Met Thr Leu Glu Glu Cys lie Asp Tyr Ala Arg Arg His Ser Ser 
20 35 40 45 

Ala Val Ala Leu Ser Ala Ala Glu Leu Glu Gin Ser Lys Ala Asp Tyr 

50 55 60 

Leu Gin Ala Val Gly Asn Phe Leu Pro Arg Val Ser Ala Gly Thr Gly 
65 70 75 80 

25 Ala Ser Trp Asn Phe Gly Arg Gly Leu Asp Ala Glu Thr Asn Thr Tyr 

85 90 95 

Thr Asp lie Asn Ser Phe Asn Asn Ser Tyr Ser lie His Ala Thr Met 

100 105 110 

Thr Leu Phe Asp Gly Leu Gin Ser Val Tyr Arg Leu Arg Met Ala His 
30 115 120 125 

Ala Arg Arg Glu Ala Ser Arg Leu Ser Val Arg Glu Gin Gin Glu Leu 

130 135 140 

Ala Ala Leu Gly Thr Thr Glu Ala Tyr Tyr Asp Leu Val Tyr Ala Arg 
145 150 155 160 

35 Gin Met Gin Glu Leu Ala Met Gin Lys Tyr Glu Glu Ser Ser Arg Leu 

165 170 175 
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His Arg Gin Thr Ala Arg Met Glu Glu Leu Gly Met Lys Ser Arg Pro 

180 185 190 

Asp Val Leu Glu Met Gin Ser Arg Met Ala Gly Asp Arg Leu Ala Leu 

195 200 205 

Thr Gin Ala Asp Asn Gin Cys He He Ala Leu He Arg Leu Lys Glu 

210 215 220 

Lys Met Asn Phe Pro He Asp Asp Glu Leu Val Val Asp Asp Met Pro 

■j-an 235 240 

225 230 

Ala Asp Ser Leu Ser Ala Asp Met Ala Glu Ser Asp Ser Ser Ala Gly 

245 250 255 

Val Phe Ala Arg Ala Ala His His His Pro Val Leu Leu Arg Ala Lys 

260 265 270 

Leu Asp Glu Gin Ala Ala Thr Asp Arg Leu Arg Ala Ala Arg Gly Ala 

275 280 285 

Phe Leu Pro Ser Val Ser Val Ser Gly Gly Trp Asn Thr Gly Phe Ser 

290 295 300 

Arg Phe Leu Asn Gly Ser Asp Tyr Thr Pro Phe Ser Glu Gin Phe Arg 
305 310 315 320 

Asn Arg Arg Gly Glu Tyr Val Ser Leu Asn Leu Ser He Pro He Phe 

325 330 335 

Ser Gly Phe Ser Leu Val Ser His Leu Arg Gin Ala Arg Ala Glu Arg 

340 345 350 

Arg Ala Ala He Val Arg Arg Gly Glu Ala Glu Arg Arg Leu Tyr Ser 

355 360 365 

Glu He Ala Gin Ala Met Ala Asp Arg Asp Ala Ala Leu Ala Ser Tyr 

370 375 380 

Arg Gin Ala Lys Glu His Thr Asp Ala Met Gin Thr Ala Tyr Glu Ala 
385 390 395 400 

Val Leu Gin Arg Tyr Glu Glu Gly Leu Asn Thr Ala He Asp Leu Thr 

405 410 415 

Thr Gin Ala Asn Arg Leu Leu Asp Ala Arg Val Gin Arg Leu Arg Ala 

420 425 430 

Ala Met Thr Tyr Arg Leu Lys Cys Lys Leu He Ala Tyr Tyr Gly Cys 
435 440 445 

Leu Ser Asp 
450 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 89 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival! s 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .834 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 9 

Met Lys Arg lie Val Leu Ser Ser Phe Leu Phe Val Leu Ser lie Leu 

15 10 15 

Ser Leu Met Ala Gin Asn Asn Thr Leu Asp Val His lie S^r Gly Thr 

25 20 25 30 

lie Lys Asp Ala Ser Ser Gly Glu Pro Val Pro Tyr Ala Thr Val Ser 

35 40 45 

lie Arg Leu Thr Gly Ala Asp Thr Thr Gin Val Phe Arg Gin Val Thr 

50 55 60 

30 Asp Gly Asn Gly Tyr Phe Val lie Gly Leu Pro Ala Ala Pro Ser Tyr 
65 70 75 80 

His Leu Thr Ala Ser Phe Val Gly Met Lys Thr His Thr Met Gin lie 

85 90 95 

Ser Arg Gly Asn Gly Gin His Asp lie Lys Ser lie Asp lie Ser Leu 

35 100 105 110 

Glu Ser Glu Asp Lys Gin Leu Ser Thr Val Thr Val Ser Ala Ala Arg 
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115 120 125 

Pro Leu Val Lys Met Glu lie Asp Arg Leu Ser Tyr Asn Met Lys Asp 

130 135 140 

Asp Pro Ala Ala Lys Thr Asn Asn Leu Leu Glu Met Leu Arg Asn Val 
5 145 150 155 160 

Pro Leu Val Thr Val Asp Gly Gin Gly Asn lie Gin Val Lys Gly Ser 

165 170 175 

Ser Asn Phe Lys lie His Leu Asn Gly Arg Pro Ser Thr Met Val Ser 
180 185 190 

10 Ser Asn Pro Lys Glu Val Phe Arg Ser lie Pro Ala His Thr lie Lys 
195 200 205 

Arg Val Glu Val lie Thr Asp Pro Gly Val Lys Tyr Asp Ala Glu Gly 

210 215 220 

Thr Ser Ala lie Leu Asp lie Val Thr Glu Glu Gly Lys Lys Leu Glu 
15 225 230 235 240 

Gly Tyr Ser Gly Ser lie Thr Ala Ser Val Ser Asn Asn Pro Thr Ala 

245 250 255 

Asn Gly Ser lie Phe Leu Thr Ala Lys Ser Gly Lys Val Gly Leu Thr 
260 265 270 

20 Thr Asn Tyr Asn Tyr Tyr Gly Gly Lys Asn Lys Gly Ser Arg Tyr Phe 
275 280 285 

Thr Glu Arg Thr Thr Ser Met Leu Gin Thr lie Glu Glu Gly Lys Gly 

290 295 300 

Gin Glu Thr Phe Gly Gly His Phe Gly Asn Ala Leu Leu Ser Phe Glu 
25 305 310 315 320 

lie Asp Ser Leu Asn Leu Phe Thr Val Gly Gly Asn Val Arg Leu Trp 

325 330 335 

Glu Met Thr Thr Asp Arg Asn Ser Val Glu Lys Ser Phe Ala Gly Ser 
340 345 350 

30 Asn Leu Met Ser Tyr lie Asp Arg Lys Leu Lys Thr Gin Met Asp Ala 
355 360 365 

Gly Ser Tyr Glu Leu Asn Ala Asp Tyr Gin His Ser Thr Arg Leu Pro 

370 375 380 

Gly Glu Leu Leu Thr Val Ser Tyr Arg Phe Thr His Asn Pro Asn Asn 
35 385 390 395 400 

Ser Glu Thr Phe lie Asp Gin Trp Lys Arg Asp Pro Leu Asn Thr Ala 
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Asn Thr 
Asp Glu 

5 

His Ser 
450 
Asp Pro 
465 

10 Gly Ser 
Gin Tyr 
Tyr Ser 

15 

Leu Phe 
530 
Trp Val 
545 

20 Leu Lys 
Leu Asn 
Asn Pro 

25 

Asn Gin 
610 
Phe Cys 
625 

30 Pro Asn 
Phe Ser 
lie Met 

35 

Leu Gly 



405 

lie Gin Tyr 

420 
His Thr Ala 
435 

Leu Glu Ala 

Leu Tyr Glu 

Leu Tyr Ala 
485 

lie Gly Ala 

500 
Leu Gin Thr 
515 

Pro Glu Asn 

Pro Gin Leu 

Leu Ala Tyr 
565 

Pro Tyr Arg 

580 
Asp Leu Lys 
595 

Tyr Gly Ala 

Asn Asn Ala 

Leu Phe His 
645 

Leu Asn Thr 
660 

Leu Asn Gly 
675 

lie Asp Val 



Ala Gly Gin 

Gin Val Asp 
440 

Gly Leu Lys 

455 

lie Arg Pro 
470 

Gin Asn Pro 

Ala Tyr Ala 

Gly Leu Arg 
520 

Ala Ala Ala 

535 
Thr Leu Gly 
550 

Asn Phe Arg 

Leu Gin Thr 

Ser Glu Lys 
600 

Lys Val Met 

615 
lie Gin Asn 
630 

Gin Thr Tyr 

Tyr Ala Met 

Asn lie Asp 
680 

Asn Ser Trp 



410 
His Ser Lys 
425 

Tyr Thr Arg 

Tyr lie Tyr 

Ser Glu Asp 
475 

Ser Asn Gly 

490 
Gly Tyr Asn 
505 

Val Glu Ser 

Asp Phe Ser 

Tyr Thr Pro 
555 

lie Gin Arg 

570 
Asn Asp Unk 
585 

Arg His His 

Leu Thr Ala 

Tyr Thr Phe 
635 

Gly Asn lie 

650 
Tyr Thr Pro 
665 

Arg Thr Phe 
Ser Gly Met 



415 

Ser Asp Ala Gly Met 
430 

Pro Leu Gly Gin Ala 
445 

Arg His Ala Thr Ser 
460 

Ala Pro Trp Gin Pro 
480 

Lys Phe Arg His Asp 
495 

Tyr Arg Lys Asp Gin 
510 

Ser Arg Leu Lys Ala 

525 

His Asn Ser Phe Asp 
540 

Ser Pro Met Lys Gin 
560 

Pro Ala lie Gly Gin 
575 

Gin Val Gin Tyr Gly 
590 

Val Gly Leu Ser Tyr 
605 

Ser Leu Asp Tyr Asp 
620 

Ser Asp Pro Ala Asn 
640 

Gly Arg Glu His Ser 
655 

Ala Val Trp Val Arg 
670 

Gin Lys Ser Glu Ala 
685 

Val Tyr Ser Gly Leu 
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690 695 700 

Met Phe Thr Leu Pro Lys Asp Trp Thr Val Asn Leu Phe Gly Gly Tyr 
705 710 715 720 

Tyr His Gly Gly Arg Ser Tyr Gin Thr Lys Tyr Asp Gly Asn Val Phe 

725 730 735 

Asn Asn lie Gly lie Ala Lys Gin Leu Phe Asp Lys Lys Leu Arg Val 

740 745 750 

Ser Leu Ser Ala Asn Asn lie His Ala Lys Tyr Ser Thr Trp Lys Ser 

755 760 765 

Arg Thr lie Gly Asn Gly Phe Thr He Tyr Ser Glu Asn Ala Gly He 

770 775 780 

Gin Arg Ser Val Ser Leu Ser Leu Thr Tyr Ser Phe Gly Lys Met Asn 
785 790 795 800 

Thr Gin Val Arg Lys Val Glu Arg Thr He Val Asn Asp Asp Leu Lys 

805 810 815 

Gin Thr Ser Ser Gin Gly Gin Gin Gly Gly Gly Gin Gly Asn Pro Thr 
820 825 830 

Gly Asn 



(2) INFORMATION FOR SEQ ID NO: 90 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 93 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 
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(B) LOCATION 1. . . 593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

5 Met Asn Ser lie Met Lys Tyr Gin Leu Tyr Thr Ala Val lie Met Ala 
15 10 15 

Leu Ser Val Ser Ser Val Cys Gly Gin Thr Pro Arg Asn Thr Glu Thr 

20 25 30 

Lys Arg Pro Asp Thr Leu Arg Arg Glu Leu Thr lie Val Asn Asp Gin 
10 35 40 45 

Thr Val Glu Met Glu His Ala Asp Pro Leu Pro Ala Ala Tyr Lys Ala 

50 55 60 

lie Glu Pro Arg Leu Lys Pro Phe Arg Pro Glu Tyr Asn Lys Arg Thr 
65 70 75 80 

15 Phe Gly Phe Val Pro Glu Val Ser Ser Ser Gly Arg Asn Asn Leu Pro 

85 90 95 

Asn lie Leu Pro Thr Glu Gly His Met Lys His Arg Gly Tyr Leu Asn 

100 105 110 

lie Gly lie Gly His Thr Leu Asn Gin Arg Met Asp Ala Gly Tyr Arg 
20 115 120 125 

Leu lie Asp Ala Glu Gin Glu Arg Leu Asn Leu Phe Leu Ser Tyr Arg 

130 135 140 

Gly Met Lys Ser Ala Phe Asn Thr Gly Asp Phe Asp Gly Asp Arg Lys 
145 150 155 - 160 

25 Asp Arg Arg Met Met Ala Gly Val Asp Tyr Glu Gin Arg Arg Pro Ser 

165 170 175 

Phe Val Leu Ala Thr Gly Leu Tyr Tyr Ser Asn His Tyr Phe Asn Asn 

180 185 190 

Tyr Gly Arg Gly Ala Thr Thr Asn Val Gly Ser lie Pro Gin Leu Ser 
30 195 200 205 

Thr Pro Val Thr Pro Gin Met Asp Asn Gly Thr His Asn Val Arg Val 

210 215 220 

Tyr Leu Gly Ala Lys Asn Asp Val lie Asp Ala Arg lie Asp Tyr Arg 
225 230 235 240 

35 Phe Phe Arg Ser lie Pro Tyr Leu Gly Thr Asp Pro Met Lys Ala Leu 

245 250 255 



» 



Thr Glu 

Asp Asp 

5 Ala Lys 
290 
Arg Asn 
305 

Asp Ser 

10 

Ser His 

Ala Ser 

15 Gly Gly 
370 
Pro Tyr 
385 

Thr Ala 

20 

Glu Val 

Pro Thr 

25 Ser Phe 
450 
Lys Leu 
465 

Ser Tyr 

30 

Pro Asp 

Leu Asp 

35 Ser Phe 
530 



His Thr Pro Glu 
260 

lie Lys Leu Gly 
275 

Asn Ser Glu Met 

Leu Tyr Tyr Val 
310 

Asp Asn Met Gin 
325 

Phe Gly Ala Lys 
340 

Leu Ser lie Phe 
355 

Val lie Arg Asn 

Leu Met Pro Asn 
390 

Gin Leu Gly Val 
405 

Tyr Gly Asp Phe 
420 

Leu Pro Leu Tyr 
435 

Leu Pro lie Tyr 

Glu Tyr Ser Tyr 
470 

Gly Lys Trp Asn 
485 

Leu lie Leu Lys 
500 

Val Arg Leu Arg 
515 

Gly Ser Ala Gly 



198 

Leu Asn Val Thr 
265 

Val Glu Val Arg 
280 

lie Gin Thr Gly 
295 

Glu Gly Ala Pro 

Trp Asn lie Gin 
330 

Gly Arg Leu Phe 
345 

Pro Ser Trp Arg 
360 

Gly Leu Ala Asp 
375 

Thr lie Val Leu 

Lys Gly Asn lie 
410 

Ser Lys Leu Thr 
425 

Asn Pro Ser Asp 
440 

Ala Asp Gly Ser 
455 

Arg Asp Met Leu 

Leu Asp Gly Gly 
490 

Ala Glu Val Gly 
505 

Tyr Thr Gin Leu 
520 

Ser Glu Ala Leu 
535 



Met Ser Asn Glu 
270 

Thr Gly Gly Leu 
285 

Val Leu Ser Glu 
300 

Thr lie Gly Phe 
315 

Ala Gly Val Gly 

Phe Trp Pro Lys 
350 

Val Tyr Ala Lys 
365 

Val Met Gin Glu 
380 

Pro Ser Arg Asn 
395 

Ala Asp Val Val 

Gly Val Pro Phe 
430 

Leu Tyr Gin Tyr 

445 ' 
Arg Trp Arg Ala 
460 

Arg Phe Leu Val 
475 

Leu Val Ala Ser 

Val His Pro lie 
510 

Asn Gly Arg Tyr 
525 

Gly lie Gly Asn 
540 



Leu Ser 

Phe Phe 

Thr Asp 

Val Gly 
320 
lie Ser 
335 

Leu Asp 

Ala Phe 

Glu Met 

Ala Leu 
400 
Arg Met 
415 

Tyr Thr 

Asn Val 

Gly Gly 

Asp Ala 
480 
Met Gin 
495 

Ala Pro 
Arg Tyr 
Val His 
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Leu Leu Ser Ala Asp Val Ser Tyr Lys Leu Lys Lys Asn Leu Ser Leu 
545 550 555 560 

Tyr Leu Lys lie Asp Asn Met Leu Ala Glu Thr Thr Glu Leu lie Gly 

565 570 575 

Tyr Tyr Pro Met Gin Pro Phe His Cys Phe Ala Gly Phe Ser Trp Thr 
580 585 590 

Phe 



10 (2) INFORMATION FOR SEQ ID NO: 91 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 9 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



( iii ) HYPOTHETICAL : YES 

20 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 
25 (A) NAME/KEY: miscjeature 

(B) LOCATION 1. . . 589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 

30 Met Lys Tyr Gin Leu Tyr Thr Ala Val lie Met Ala Leu Ser Val Ser 
15 10 15 

Ser Val Cys Gly Gin Thr Pro Arg Asn Thr Glu Thr Lys Arg Pro Asp 

20 25 30 

Thr Leu Arg Arg Glu Leu Thr lie Val Asn Asp Gin Thr Val Glu Met 

35 35 40 45 

Glu His Ala Asp Pro Leu Pro Ala Ala Tyr Lys Ala lie Glu Pro Arg 
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50 

Leu Lys Pro 
65 

Pro Glu Val 

5 

Thr Glu Gly 

His Thr Leu 
115 

10 Glu Gin Glu 
130 

Ala Phe Asn 
145 

Met Ala Gly 

15 

Thr Gly Leu 

Ala Thr Thr 
195 

20 Pro Gin Met 
210 
Lys Asn Asp 
225 

lie Pro Tyr 

25 

Pro Glu Leu 

Leu Gly Val 
275 

30 Glu Met lie 
290 
Tyr Val Glu 
305 

Met Gin Trp 

35 

Ala Lys Gly 



55 

Phe Arg Pro Glu 
70 

Ser Ser Ser Gly 
85 

His Met Lys His 
100 

Asn Gin Arg Met 

Arg Leu Asn Leu 
135 

Thr Gly Asp Phe 
150 

Val Asp Tyr Glu 
165 

Tyr Tyr Ser Asn 
180 

Asn Val Gly Ser 

Asp Asn Gly Thr 
215 

Val lie Asp Ala 
230 

Leu Gly Thr Asp 
245 

Asn Val Thr Met 
260 

Glu Val Arg Thr 

Gin Thr Gly Val 
295 

Gly Ala Pro Thr 
310 

Asn lie Gin Ala 
325 

Arg Leu Phe Phe 



Tyr Asn Lys Arg 
75 

Arg Asn Asn Leu 
90 

Arg Gly Tyr Leu 
105 

Asp Ala Gly Tyr 
120 

Phe Leu Ser Tyr 

Asp Gly Asp Arg 
155 

Gin Arg Arg Pro 
170 

His Tyr Phe Asn 
185 

lie Pro Gin Leu 
200 

His Asn Val Arg 

Arg lie Asp Tyr 
235 

Pro Met Lys Ala 
250 

Ser Asn Glu Leu 
265 

Gly Gly Leu Phe 
280 

Leu Ser Glu Thr 

lie Gly Phe Val 
315 

Gly Val Gly lie 
330 

Trp Pro Lys Leu 



60 

Thr Phe Gly Phe Val 
80 

Pro Asn lie Leu Pro 
95 

Asn lie Gly lie Gly 
110 

Arg Leu lie Asp Ala 
125 

Arg Gly Met Lys Ser 
140 

Lys Asp Arg Arg Met 
160 

Ser Phe Val Leu Ala 
175 

Asn Tyr Gly Arg Gly 
190 

Ser Thr Pro Val Thr 
205 

Val Tyr Leu Gly Ala 
220 

Arg Phe Phe Arg Ser 
240 

Leu Thr Glu His Thr 
255 

Ser Asp Asp lie Lys 
270 

Phe Ala Lys Asn Ser 
285 

Asp Arg Asn Leu Tyr 
300 

Gly Asp Ser Asp Asn 
320 

Ser Ser His Phe Gly 
335 

Asp Ala Ser Leu Ser 
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lie Phe Pro 
355 

Arg Asn Gly 
5 370 

Pro Asn Thr 
385 

Gly Val Lys 

10 Asp Phe Ser 

Leu Tyr Asn 
435 

lie Tyr Ala 
15 450 

Ser Tyr Arg 
465 

Trp Asn Leu 

20 Leu Lys Ala 

Leu Arg Tyr 
515 

Ala Gly Ser 
25 530 

Asp Val Ser 
545 

Asp Asn Met 
30 Gin Pro Phe 



340 

Ser Trp Arg Val 

Leu Ala Asp Val 
375 

lie Val Leu Pro 
390 

Gly Asn lie Ala 
405 

Lys Leu Thr Gly 
420 

Pro Ser Asp Leu 

Asp Gly Ser Arg 
455 

Asp Met Leu Arg 
470 

Asp Gly Gly Leu 
485 

Glu Val Gly Val 
500 

Thr Gin Leu Asn 

Glu Ala Leu Gly 
535 

Tyr Lys Leu Lys 
550 

Leu Ala Glu Thr 
565 

His Cys Phe Ala 
580 



345 

Tyr Ala Lys Ala Phe 
360 

Met Gin Glu Glu Met 
380 

Ser Arg Asn Ala Leu 
395 

Asp Val Val Arg Met 
410 

Val Pro Phe Tyr Thr 
425 

Tyr Gin Tyr Asn Val 
440 

Trp Arg Ala Gly Gly 
460 

Phe Leu Val Asp Ala 
475 

Val Ala Ser Met Gin 
490 

His Pro lie Ala Pro 
505 

Gly Arg Tyr Arg Tyr 
520 

lie Gly Asn Val His 
540 

Lys Asn Leu Ser Leu 
555 

Thr Glu Leu lie Gly 
570 

Gly Phe Ser Trp Thr 
585 



350 

Gly Gly Val lie 
365 

Pro Tyr Leu Met 

Thr Ala Gin Leu 
400 

Glu Val Tyr Gly 
415 

Pro Thr Leu Pro 
430 

Ser Phe Leu Pro 
445 

Lys Leu Glu Tyr 

Ser Tyr Gly Lys 
480 

Pro Asp Leu lie 
495 

Leu Asp Val Arg 
510 

Ser Phe Gly Ser 
525 

Leu Leu Ser Ala 

Tyr Leu Lys lie 
560 

Tyr Tyr Pro Met 
575 

Phe 



(2) INFORMATION FOR SEQ ID NO: 92 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

* 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

10 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1 . . . 330 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

Met lie Arg Lys His Phe Gly lie lie Leu Gly Phe Leu Ser Leu Val 
15 10 15 

Phe Ser Ala Gly Ala Gin Gin Glu Lys Gin Val Phe His Phe Leu Asn 
20 20 25 30 

Leu Pro Ala Thr Ala Gin Ala Leu Ala Ala Gly Gly Lys Ala lie Thr 

35 40 45 

lie Val Asp Asp Asn Pro Gly Leu Ala Phe Glu Asn Pro Ala Leu Leu 
50 55 60 

25 Gly Tyr Glu Ser Gly Gly Arg Ala Phe Leu Ser Tyr Leu Tyr Tyr Met 
65 70 75 80 

Ser Gly Ser His Met Gly Asn Ala Cys Tyr Ala Ser Ser Val Gly Glu 

85 90 95 

Arg Gly Met Trp Gly Val Gly Met Arg Phe Leu Asn Tyr Gly Ser Met 
30 100 105 110 

Gin Gly Tyr Asp Gin Asn Ala lie Ala Thr Gly Ser Phe Ser Ala Ser 

115 120 125 

Asp lie Ala Val Gin Gly Phe Tyr Ser His Glu Leu Ser Asn His Phe 
130 135 140 

35 Arg Gly Gly Val Ser Leu Lys Ala Leu Tyr Ser Ser lie Glu Thr Tyr 
145 150 155 160 
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Ser Ser Phe Gly Leu Gly Val Asp Val Gly lie Ser Tyr Tyr Asp Asp 

165 170 175 

Asp Lys Gly Tyr Ser Ala Ser Ala Leu Phe Lys Asn Val Gly Ala Gin 
180 185 190 

5 Leu Lys Gly Tyr Asn Glu Glu Arg Glu Pro Leu Asp Trp Asp Phe Gin 
195 200 205 

Leu Gly Phe Ser Arg Ser Phe lie Asn Ala Pro Phe Arg Leu His lie 

210 215 220 

Thr Leu Phe Asn Leu Asn Pro His Tyr Phe Lys Arg Leu Val Pro Arg 
10 225 230 235 240 

Asp Leu Ser Lys Met Gin Lys Phe Leu Arg His Phe Ser lie Gly Ala 

245 250 255 

Glu Phe Thr Pro Ser Glu Arg Phe Trp Val Gly Leu Gly Tyr Thr Pro 
260 265 270 

15 Gin lie Ala Gin Asp Phe Glu Val Glu Gly Gly Asn Lys Trp Gly Gly 
275 280 285 

Leu Ser Ala Gly Val Gly Phe Thr Ser Gly Val Val Arg Val Gly Val 

290 295 300 

Ser Ala Ala Thr Tyr His Pro Ala Ala Leu Ser Phe Met Cys Ser Val 
20 305 310 315 320 

Gly lie Arg Leu Asp Asp Lys Ser lie Phe 
325 330 

(2) INFORMATION FOR SEQ ID NO: 93 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

30 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1. . .229 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

Met Met Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu Phe Ser 
15 10 15 

10 Ser Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg Ser Tyr 
20 25 30 

Phe Thr Gly Arg lie Glu Lys Val Ser Leu Asn Leu Gly Val Pro Pro 

35 40 45 

Val Ser Thr Glu Val Trp Gly Met Thr His Asp Ala Asn Gly Leu Pro 
15 50 55 60 

Phe Glu lie Pro lie Ser Phe Ser Arg Phe Asn Ser Gin Gly Asp He 
65 70 75 80 

Ala Thr Thr Tyr Tyr He Ala Asn Ser Glu Ala Thr Leu Asn Glu Trp 
85 90 95 

20 Cys Asp Tyr Ala His Pro Gly Gly He Val Arg Val Glu Gly Arg Phe 
100 105 110 

Trp Lys Met Thr Tyr Asn He Pro Thr Tyr Asn Ala Val Cys Thr Arg 

115 120 125 

He Thr Phe Glu Asn Gin Glu He Glu Gly Thr He Val Leu He Pro 
25 130 135 140 

Lys Pro Lys Val Ser Leu Pro His Val Ser Glu Ser Val Pro Cys He 
145 150 155 160 

Arg Thr Glu Ala Gly Arg Glu Phe He Leu Cys Glu Glu Asp Asp Thr 
165 170 175 

30 Phe Val Ser His Asp Gly Asn Glu Val Thr He Gly Gly Lys Pro Phe 
180 185 190 

Leu Leu Asn Thr Asn Val Lys He Val Gly Asp Val Ser Gin Lys Tyr 

195 200 205 

Ala Val Gly Val Gly Glu He Arg Phe Leu Gin He Cys Ala Gin Thr 
35 210 215 220 

Val Ser Gin Gin Lys 
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225 

(2) INFORMATION FOR SEQ ID NO: 94 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc__f eature 

(B) LOCATION 1. . .228 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 



Met Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu Phe Ser Ser 

1 5 10 - - 15 

25 Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg Ser Tyr Phe 

20 25 30 

Thr Gly Arg lie Glu Lys Val Ser Leu Asn Leu Gly Val Pro Pro Val 

35 40 45 

Ser Thr Glu Val Trp Gly Met Thr His Asp Ala Asn Gly Leu Pro Phe 
30 50 55 60 

Glu lie Pro lie Ser Phe Ser Arg Phe Asn Ser Gin Gly Asp He Ala 

65 70 75 . 80 

Thr Thr Tyr Tyr He Ala Asn Ser Glu Ala Thr Leu Asn Glu Trp Cys 

85 90 95 

35 Asp Tyr Ala His Pro Gly Gly He Val Arg Val Glu Gly Arg Phe Trp 

100 105 110 
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Lys Met Thr Tyr Asn He Pro Thr Tyr Asn Ala Val Cys Thr Arg He 

115 120 125 

Thr Phe Glu Asn Gin Glu He Glu Gly Thr He Val Leu He Pro Lys 
130 135 140 

5 Pro Lys Val Ser Leu Pro His Val Ser Glu Ser Val Pro Cys He Arg 
145 150 155 160 

Thr Glu Ala Gly Arg Glu Phe He Leu Cys Glu Glu Asp Asp Thr Phe 

165 170 175 

Val Ser His Asp Gly Asn Glu Val Thr He Gly Gly Lys Pro Phe Leu 
10 180 185 190 

Leu Asn Thr Asn Val Lys He Val Gly Asp Val Ser Gin Lys Tyr Ala 

195 200 205 

Val Gly Val Gly Glu He Arg Phe Leu Gin He Cys Ala Gin Thr Val 
210 215 220 

15 Ser Gin Gin Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 95 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .547 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 
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Met Lys Thr lie Val Arg Tyr Ser Arg Leu Pro Val Ala Leu Phe Phe 
15 10 15 

Cys Leu Leu Gly Ala Val His Leu Ser Val Glu Ala Gin Met Leu Asn 
5 20 25 30 

Thr Pro Phe Glu Leu Ser Asp Gin lie Val Leu Ser Pro Thr Glu Arg 

35 40 45 

Gin Tyr Arg Glu lie Cys Val Gin Thr Lys Glu Lys Arg Gly Ala Asp 
50 55 60 

10 Leu Phe Pro Leu Ser Asp Lys Leu Arg Asp Ser Ala Tyr Val Arg Phe 
65 70 75 80 

Gly Ser Ala Tyr Gly Asp lie Ala Gly Asp Tyr Leu Pro Tyr Asn Gly 

85 90 95 

Asn Asn Tyr Ser Ser Leu Ser Leu Glu Ser Gly Gly Arg lie Ser Val 
15 100 105 110 

Arg Asn Tyr Gly Thr Leu Gin Gly Ser Ala Ser Tyr Ser Arg Gly Met 

115 120 125 

His Lys Arg lie Gly Trp Asn Ala Leu Arg Asn Ala Glu Ala Tyr Tyr 
130 135 140 

20 Pro Tyr Leu Val Ser Asp Ser Thr Gly Gly Asp Tyr His Phe Glu Asp 
145 150 155 160 

Tyr Arg Leu Ala Gly Tyr Tyr Ser Phe Arg Ala Gly Arg Leu Pro Leu 

165 170 175 

Gly lie Gly Phe Ser Tyr Arg Gly Glu Val Ala Tyr Arg Leu Thr Asp 
25 180 185 190 

Pro Arg Thr Thr Asn Thr Thr Gly Ala Leu Glu Leu Ser Cys Ala Thr 

195 200 205 

Ser Leu Thr Leu Pro Arg Glu Asn Arg Leu Ser Leu Ser Ala Ala Tyr 
210 215 220 

30 Leu Tyr His Arg Gin His Leu Thr Gin Tyr Asn Trp Arg Pro Gly Gin 
225 230 235 240 

Gin Asp Lys Phe Phe Val Ser Tyr Gly Phe Gly Gin Val Asp Val Ser 

245 250 255 

Asn Ser Pro lie Trp Phe Gly lie Ser Arg Met Asn Tyr Val Asn Gly 
35 260 265 270 

Trp Lys Leu Ser Ser Arg Leu Asp Thr Arg Arg Gly Asp Ala lie Gly 
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275 280 285 

Leu Asp Tyr Ser Gly Tyr Phe Leu Asp Thr Glu Glu Arg Ser Ser lie 

290 295 300 

Asn Leu Phe Ala Leu Leu Tyr Asn Arg Leu Arg Leu Tyr Gly Ser Trp 
5 305 310 315 320 

His Leu Ser Asp Phe Asp Phe Ser Phe Ser Ala Asp Tyr Ala Leu Arg 

325 330 335 

Gin Gly lie Glu Arg lie Tyr Glu Asp Tyr Lys Pro Asp Asp Asn Tyr 
340 345 350 

10 His lie Tyr Asp Leu Arg lie Leu Ala lie Arg Arg Trp Tyr Met Leu 
355 360 365 

Asn Glu Phe Ser Ala Gin Ala Gin Ala Ser Tyr Arg lie Arg Thr Asp 

370 375 380 

Arg Gly Cys Ala Leu Arg Val Ser Ala Gly Ser Asp Phe Tyr Gly Tyr 
15 385 390 395 400 

Asp Glu Thr Tyr Arg Lys His Gly His His Thr Met Ser Gly Met Leu 

405 410 415 

Arg Pro Phe Ala Gly lie Ala Tyr Asp His Ala Gly Ser Lys Leu Asp 
420 425 430 

20 Phe Gly Leu Ser Leu Ser Ala Ala Tyr Arg Met Val Leu Thr His Ser 
435 440 445 

Tyr Lys lie Arg Thr lie Gin Lys Glu Gin Leu Asp Tyr Gin Leu Ala 

450 455 460 

Tyr Leu Pro Tyr Ala Tyr Arg Asn Arg Glu Gly Val Glu Val Arg Ser 
25 465 470 475 480 

Ser Leu Tyr Val Ser lie Pro Met Gin Asn Thr His Arg Leu Met Thr 

485 490 495 

Glu Leu Arg Leu Tyr Gly Asp Leu Met Lys Arg Lys Asp Gly lie Ala 
500 505 510 

30 Tyr Gly Lys Thr Pro Gly Val lie Ser His lie Leu Ser Asp Pro Gin 
515 520 525 

Ala Glu Arg Thr Ser Gly His Thr lie Gly Ala lie Cys Asn lie Ser 

530 535 540 

Tyr Leu Phe 
35 545 



4 
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(2) INFORMATION FOR SEQ ID NO: 96 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

15 (ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1. . .750 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 



Met Lys Lys Leu His Met lie Ala Ala Leu Ala Val Leu Pro Phe Cys 
15 10 15 

Leu Thr Ala Gin Ala Pro Val Ser Asn Ser Glu lie Asp Ser Leu Ser 

20 25 3-0 

25 Asn Val Gin Leu Gin Thr Val Gin Val Val Ala Thr Arg Ala Thr Ala 
35 40 45 

Lys Thr Pro Val Ala Tyr Thr Asn Val Arg Lys Ala Glu Leu Ser Lys 

50 55 60 

Ser Asn Tyr Gly Arg Asp lie Pro Tyr Leu Leu Met Leu Thr Pro Ser 
30 65 70 75 80 

Val Val Ala Thr Ser Asp Ala Gly Thr Gly lie Gly Tyr Ser Gly Phe 

85 90 95 

Arg Val Arg Gly Thr Asp Ala Asn Arg lie Asn lie Thr Thr Asn Gly 

100 105 110 

35 Val Pro Leu Asn Asp Ser Glu Ser Gin Ser Val Phe Trp Val Asn Met 
115 120 125 
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10 



Pro Asp Phe Ala Ser Ser lie Glu Asp Leu Gin Val Gin Arg Gly Val 

130 135 140 

Gly Thr Ser Thr Asn Gly Ala Gly Ala Phe Gly Ala Ser Val Asn Met 
145 150 155 160 

5 Arg Thr Asp Asn Leu Gly Leu Ala Pro Tyr Gly Arg Val Asp Leu Ser 

165 170 175 

Gly Gly Ser Phe Gly Thr Phe Arg Arg Ser Val Lys Leu Gly Ser Gly 

180 185 190 

Arg lie Gly Arg His Trp Ala Val Asp Ala Arg Leu Ser Lys lie Gly 

195 200 205 

Ser Asp Gly Tyr Val Asp Arg Gly Ser Val Asp Leu Lys Ser Tyr Phe 

210 215 220 

Ala Gin Val Gly Tyr Phe Gly Ser Asn Thr Ala Leu Arg Phe lie Thr 
225 230 235 240 

15 Phe Gly Gly Lys Glu Val Thr Gly lie Ala Trp Asn Gly Leu Ser Lys 

245 250 255 

Glu Asp Glu Ala Lys Tyr Gly Arg Arg Tyr Asn Ser Ala Gly Leu Met 

260 265 270 

Tyr Val Asp Ala Gin Gly Val Pro His Tyr Tyr His Asn Thr Asp Asn 
20 275 280 285 

Tyr Glu Gin Arg His Tyr His Ala lie Met Thr His Ser Phe Ser Pro 

290 295 300 

Ser Val He Leu Asn Leu Thr Ala His Tyr Thr Ala Gly Tyr Gly Tyr 
305 310 315 ' ' 320 

25 Thr Asp Glu Tyr Arg Thr Gly Arg Lys Leu Lys Glu Tyr Ala Leu Gin 

325 330 335 

Pro Tyr Val Glu Asn Ser Val Thr Val Lys Lys Thr Asp Leu He Arg 

340 345 350 

Gin Lys Tyr Leu Asp Asn Asp Phe Gly Gly Leu He Gly Ser Leu Asn 
30 355 360 365 

Trp His Thr Gly Ala Trp Asp Leu Gin Phe Gly Ala Ser Gly Asn He 

370 375 380 

Tyr Lys Gly Asp His Phe Gly Arg He Thr Tyr He Lys Lys Tyr Asn 
385 390 395 400 

35 Gin Pro Leu Ala Pro Asp Phe Glu Tyr Tyr Arg Asn Arg Ala Asp Lys 

405 410 415 
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Arg Glu Gly 

Leu Asn Met 
435 

5 Asn Gly lie 
450 
lie Asp Leu 
465 

Thr Tyr Ser 

10 

Ala His Arg 

Gin Tyr Pro 
515 

15 Tyr Ala Ser 
530 

Tyr Lys Asp 
545 

Met Leu Thr 

20 

Thr Leu Gly 

Phe Thr Met 
595 

25 Val Tyr Asp 
610 

Glu Ser Thr 
625 

Leu Thr Leu 

30 

Val Ser Lys 

Ser Ser Tyr 
675 

35 His Phe Val 
690 



Ala Ala Phe Ala 
420 

Tyr Ala Asp Leu 

Thr Asp Glu Tyr 
455 

Asp Lys Thr Phe 
470 

Phe Asp Asp Ala 
485 

Glu Pro Asn Arg 
500 

Thr Pro Glu Arg 

Pro Leu Leu Ser 
535 

Gin Leu Val Leu 
550 

Ser Asn Val Pro 
565 

Trp Gin lie Leu 
580 

Ser Arg Asn Lys 

Ala Asp Tyr Asn 
615 

Asp lie Ala Tyr 
630 

Ser His Ala Gly 
645 

Gin Tyr Leu Asp 
660 

Trp Val Asn Asp 

Lys Arg Val Ala 
695 



Lys Ala Asn Trp 
425 

Gin Tyr Arg Thr 
440 

Asp Glu Val Gin 

Arg Phe Leu Asn 
475 

His Thr Ala Tyr 
490 

Thr Asn Tyr Thr 
505 

Leu lie Asp Tyr 
520 

Ala Gly Val Gly 

Asp Gly Arg Leu 

555 

Asp Ser Tyr Arg 
570 

Pro Arg Leu Leu 
585 

lie Asp Arg Tyr 
600 

Trp Leu Glu Leu 

Ser Pro Asn Val 
635 

Phe Glu Met Ala 
650 

Asn Thr Gin Arg 
665 

Leu Arg Leu Gly 
680 

Leu Gly Val Gin 



Gin lie Thr Pro Glu 
430 

lie Gly Tyr Thr lie 
445 

Gly Ser Met Gin His 
460 

Pro Lys Ala Gly Leu 
480 

Ala Ser Val Ala Val 
495 

Glu Ala Gly lie Gly 
510 

Glu Leu Gly Tyr Arg 
525 

Leu Tyr Tyr Met Gin 
540 

Ser Asp Val Gly Gin 

560 

Met Gly Leu Glu Leu 
575 

Arg Trp Asp Ala Ser 
590 

Val Gin Tyr Thr Ser 

605 - 
Lys Glu Glu Thr Leu 
620 

lie Ala Gly Ser Met 
640 

Trp Thr Ser Arg Phe 
655 

Ser Asp Arg Met Leu 
670 

Tyr Val Leu Pro Val 
685 

Leu Asn Asn Leu Phe 
700 
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Asn Leu Met Tyr Ala Ser Asn Ala Tyr lie Tyr Asp Ala Gly Tyr Val 
705 710 715 720 

Gin Ala Ser Gly Glu Leu Ser Ala Tyr Ala Asp Leu Arg Tyr Tyr Pro 

725 730 735 

Gin Ala Gly Phe Asn Ala Leu Gly Unk Leu Thr lie Asp Phe 
740 745 750 

(2) INFORMATION FOR SEQ ID NO: 97 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

Met Val Val Thr Leu Leu Val lie Val Gly lie Val Val Val Val Arg 

15 10 15 

Tyr Ser Leu Arg Val His Val His Lys Thr Gly Thr Val Val Ser Ala 

20 25 30 

Ala lie Phe Gly Phe lie Leu Leu Gly Lys Thr Val Pro Cys Asp Thr 

35 40 45 

Arg Asn Phe Phe Ser Ser Glu Ser Asp Glu Pro Glu Ser Arg Val Ala 

50 55 60 

Thr Glu lie Ala His Leu Cys Glu lie Gly Phe Gin lie His Ala Ser 
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65 70 75 80 

Ser lie His Val Ala Val Arg Thr Asp Phe Gly Gin Ala Gly lie His 

85 90 95 

Cys Pro Met Ala Thr Asp Ala Ser Ala Thr Glu Phe Asp Arg Ser Ala 
5 100 105 HO 

Glu Cys Ala Glu Arg Thr Ser Ala Gin lie Asp Thr Ala lie Arg Ser 

115 120 125 

Gin Ser Gin He He Arg Thr His He Asp Thr Cys Pro Lys Ser Ser 
130 135 140 

10 Gly Thr He Gly Gly Ser Thr His Thr Ser Leu His Leu Lys Val Phe 
145 150 155 160 

Asp Gly Arg Gly Glu Val Gly His He His Pro Lys Asp Gly Leu Arg 

165 170 175 

Phe Gly Val Val Glu Gly Tyr Ser He Gly Ser Tyr Val Asp Ala He 
15 180 185 190 

Gly He Gly Ala Thr His Ala Lys Ala Gly He Ser Asp Thr Arg Thr 

195 200 205 

Gly He Ala Gly Gly Tyr His Gly Gly Ser Gin His Gin Gin He Gly 
210 215 220 

20 Asp Val Thr Thr He He Gly Leu Gly Lys Phe Gly Leu Ala Asn Val 
225 230 235 240 

Gly Val Ser Asp Arg Gly Phe Arg Arg Gly Ala Ser Ser Tyr Asp Leu 

245 250 255 

Tyr Gly Leu Glu Leu His He Ala Lys Thr He Tyr Leu Ala Val Gly 
25 260 265 270 

Asp Gly Cys Leu Cys Arg Gin Ala Glu Arg Gin Asp Gly 
275 280 285 



(2) INFORMATION FOR SEQ ID NO: 98 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .240 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

Met Lys Lys Ala lie Leu Ser Gly Ala Ala Leu Leu Leu Gly Leu Cys 
15 10 15 

15 Ala Asn Ala Gin Asn Val Gin Leu His Tyr Asp Phe Gly His Ser lie 
20 25 30 

Tyr Asp Glu Leu Asp Gly Arg Pro Lys Leu Thr Thr Thr Val Glu Asn 

35 40 45 

Phe Thr Pro Asp Lys Trp Gly Ser Thr Phe Phe Phe lie Asp Met Asp 

20 50 55 60 

Tyr Thr Gly Lys Gly lie Gin Ser Ala Tyr Trp Glu lie Ser Arg Glu 
65 70 75 80 

Leu Lys Phe Trp Gin Ala Pro Val Ser lie His Leu Glu Tyr Asn Gly 
85 90 • - 95 

25 Gly Leu SerVThr Ser Phe Thr Phe Gly His Asp Ala Leu lie Gly Ala 
100 105 110 

Thr Tyr Thr Tyr Asn Asn Pro Ser Phe Thr Arg Gly Phe Thr lie Thr 

115 120 125 

Pro Met Tyr Lys His Leu Gly Ala His Asp Phe His Thr Tyr Gin lie 

30 130 135 140 

Thr Gly Thr Trp Tyr Met His Phe Leu Asp Gly Leu Leu Thr Phe Asn 
145 150 155 160 

Gly Phe Leu Asp Leu Trp Gly Phe Pro Gin Glu Asn Pro lie Gly Gly 
165 170 175 

35 Pro Val Leu Lys Glu Gly Asp Lys Phe Val Phe Leu Ser Glu Pro Gin 
180 185 190 
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Phe Trp lie Asn Leu Asn Arg lie Lys Gly lie Asp Lys Asp Phe Asn 

195 200 205 

Leu Ser lie Gly Thr Glu Met Glu lie Ser Arg Asn Phe Ala Arg Met 

210 215 220 

Asp Lys Phe Ser Cys lie Pro Thr Leu Ala Val Lys Trp Thr Phe Asn 
225 230 235 240 



(2) INFORMATION FOR SEQ ID NO: 99 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . . 399 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

Met Arg Leu Ser Ala lie Leu lie Ala Leu lie Val Met Leu Pro Ala 

15 10 15 

Val Leu Ser Gly Gin His Tyr Tyr Ser Met Ala Gly Glu Arg Leu Glu 

20 25 30 

Thr Asp Ser lie Arg Pro Asn Glu Leu Ser Ala Ser lie Arg Ser Ala 

35 40 45 

Leu Phe Phe Arg Asn Asn Glu Tyr Asn Ala Arg Ser Val Lys Gly Tyr 

50 55 60 

Thr Leu Pro Gly Ala Arg Val Ser Ala Phe Ala Ser Tyr Ser Leu Pro 
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65 70 75 80 

Ala Ala His Gly Val Lys Leu Ser Leu Gly Val Ser Thr Leu Asn Tyr 

85 90 95 

Trp Gly Ala Ser Arg Tyr Pro Ala Gly lie Ala Tyr Ser Asp Leu Pro 
5 100 105 110 

Tyr Trp Thr Asp Tyr Asn Asp Tyr Val Arg Leu Arg lie Leu Pro Tyr 

115 120 125 

Val Gin Ala Met Leu Lys Pro Thr Ala Thr Thr Ala Leu Met Leu Gly 
130 135 140 

10 Asn lie Ala Gly Gly Thr Ala His Gly Leu lie Glu Pro lie Tyr Asn 
145 150 155 160 

Pro Glu Leu Asp Leu Thr Ala Asp Pro Glu Ala Gly Val Gin Phe Arg 

165 170 175 

Gly Asp Trp Thr Arg Phe Arg Met Asp Val Trp Val Asn Trp Met Ser 
15 180 185 190 

Met lie Phe Lys Asn Asp Asn His Gin Glu Ser Phe Val Phe Gly Leu 

195 200 205 

Ser Thr Thr Ser Lys Leu Leu Ser Gly Glu Gly Lys Trp Arg Leu Glu 
210 215 220 

20 Leu Pro Leu Gin Ala lie Ala Thr His Arg Gly Gly Glu Tyr Asn Trp 
225 230 235 240 

Ala Gin Gin Asp Thr Val His Thr Trp Val Asn Gly Ala Val Gly Leu 

245 250 255 

Lys Leu Ser Tyr Arg Pro Arg Thr Asp Lys Pro Met Gin Xle Trp Gly 
25 260 265 270 

Ser Ala Tyr Gly Val Ala Ala Leu Ser Ser Gly Gly Tyr Phe Pro Tyr 

275 280 285 

Glu Arg Gly Trp Gly Gly Tyr Leu Ser Leu Gly Met Asp Leu Glu His 
290 295 300 

30 Phe Ala Phe Arg Thr Asp Tyr Trp Tyr Gly Arg His Tyr Val Ser Pro 
305 310 315 320 

Phe Ala Ala Pro Phe Ala Asn Ser Leu Thr Tyr Asp Lys Gin Pro Leu 

325 330 335 

Thr Asn Gly Trp Gly Asp Tyr lie Arg Leu Tyr Ala Asp Tyr Ser Trp 
35 340 345 350 

Arg Met Ala Arg Ser Val Ser Leu Ala Ala Val Ala Arg Val Trp Phe 
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355 360 365 

Gin Pro Ser Asp Arg Phe Ala Met Ser His Ala Leu Glu Leu Thr Met 

370 375 380 

Arg lie Asp Pro Lys Phe Pro lie Ala Phe Leu Lys Gly Asn His 
385 390 395 



(2) INFORMATION FOR SEQ ID NO: 100 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 382 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION 1. . .382 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 



Met Asn Lys Ser Leu Leu Ser Leu Ala Cys Leu lie Leu Cys Gly Met 

15 10 15 

Pro Ala lie Ala Gin Gin Thr Gly Pro Ala Glu Arg Ser Gly Glu Pro 

20 25 30 

Ser Leu Ala Glu Arg Val Phe Gly Leu Glu Gin Lys Gin Lys Lys Leu 

35 40 45 

Lys Val Tyr Leu Gly lie Gin Ser Phe Tyr Asp Gin Pro Leu Val Asp 

50 55 60 

Asp Glu Ser His lie Gly His Phe Lys Val Gin Glu Leu Arg Met Ser 
65 70 75 80 
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Ala His Gly Glu Leu Asn Arg His Leu Ser Phe Asp Trp Arg Gin Arg 

85 90 95 

Leu Asn Arg Ala Ala Asp Gly Thr Ser Phe Ala Asp Asn Leu Ser Asn 
100 105 110 

5 Ala lie Asp lie Ala Gly Val Asp Trp His Pro Asn Asp Lys Val Ser 
115 120 125 

Phe Phe Phe Gly Arg Gin Tyr Ala Arg Phe Gly Gly lie Glu Tyr Asp 

130 135 140 

Met Asn Pro Val Glu lie Tyr Gin Tyr Ser Asp Leu Val Asp Tyr Met 
10 145 150 155 160 

Thr Cys Tyr Thr Ser Gly Val Asn Phe Ala Trp Asn Phe His Pro Glu 

165 170 175 

Gin Gin Leu Gin Leu Gin Val Leu Asn Ala Tyr Asn Asn Arg Phe Ala 
180 185 190 

15 Asp Arg Tyr His Val Thr Pro Asp Val Ala Thr Ala Thr Ser Tyr Pro 
195 200 205 

Leu Leu Tyr Ser Ala Gin Trp Asn Gly Thr Leu Leu Gly Gly Ala Leu 

210 215 220 

His Met Arg Tyr Ala Val Ser Met Ala His Gin Ala Gin Glu Arg Asn 
20 225 230 235 240 

Met Trp Tyr Phe Thr Ala Gly Asn Leu Phe Asn Pro Gly Lys Arg lie 

245 250 255 

Asn Gly Tyr Leu Asp Leu Thr Tyr Ser lie Glu Gly Leu Asp Asp Lys 
260 265 * 270 

25 Gly He Met Thr Ala Arg Tyr Gly Lys Gly Lys Thr Leu Thr Asp Val 
275 280 285 

Lys Tyr Tyr Ala Leu Val Ser Lys Trp Asn Phe Arg He Phe Asp Gin 

290 295 300 

Val Asn Leu Phe Leu Lys Gly Met Tyr Glu Asn Gly Tyr Ala Pro Ala 
30 305 310 315 320 

Gin Tyr Gly Glu Ser Ser His Thr Arg His Ser Tyr Gly Tyr Met Gly 

325 330 335 

Gly Val Glu Tyr Tyr Pro Thr Glu Thr Asn Phe Arg Leu Phe Val Thr 
340 345 350 

35 Tyr He Gly Arg His Tyr Arg Tyr Ser Ala Thr Glu Thr Glu Ser Thr 
355 360 365 



5 



10 
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Asn Ala Leu Arg Ala Gly Leu lie Tyr Gin lie Pro Phe Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 
20 (B) LOCATION 1...494 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 

Met Lys Arg Arg Phe Leu Ser Leu Leu Leu Leu Tyr lie Leu Ser Ser 
25 1 5 10 15 

lie Ser Leu Ser Ala Gin Arg Phe Pro Met Val Gin Gly lie Glu Leu 

20 25 30 

Asp Thr Asp Ser Leu Phe Ser Leu Pro Lys Arg Pro Trp Arg Ala lie 

35 40 45 

30 Gly Lys Thr lie Gly Val Asn Leu Ala Val Trp Gly Phe Asp His Phe 
50 55 60 

lie Met Asn Glu Asp Phe Ala Asp lie Ser Trp Gin Thr lie Lys Ser 
65 70 75 80 

Asn Phe Gin Thr Gly Phe Gly Trp Asp Asn Asp Lys Phe Val Thr Asn 
35 85 90 95 

Leu Phe Ala His Pro Tyr His Gly Ser Leu Tyr Phe Asn Ala Ala Arg 
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100 105 HO 

Ser Asn Gly Leu Ser Phe Arg His Ser Ala Pro Phe Ala Phe Phe Gly 

115 120 125 

Ser Leu Met Trp Glu Leu Leu Met Glu Asn Glu Pro Pro Ser He Asn 

130 135 140 

Asp Leu Cys Ala Thr Thr He Gly Gly He Ala Leu Gly Glu Met Gly 
145 150 155 160 

His Arg Leu Ser Asp Leu Leu lie Asp Asn Arg Thr Thr Gly Trp Glu 

165 170 175 

Arg Met Gly Arg Glu Val Ala He Ala Leu He Asn Pro Met Arg Phe 

180 185 190 

Leu Asn Arg Leu Thr Ala Gly Glu Val Thr Ser Val Gly Ser Arg Ser 

195 200 205 

Gly Gin He Phe Gin Ser Val Pro He Asn He Val Val Asp Ala Gly 

210 215 220 

Phe Arg Phe Leu Ala Asp Lys Arg His Ala Arg Thr Gly Ala Thr Ala 
225 230 235 240 

Leu Thr Leu Asn Leu Arg Phe Asp Tyr Gly Asp Pro Phe Arg Ser Glu 

245 250 255 

Thr Phe Ser Pro Tyr Asp Phe Phe Gin Phe Lys Ala Gly Leu Ser Phe 

260 265 270 

Ser Glu Ser Gin Pro Leu Leu Ser Gin He Asn Leu He Gly He Leu 

275 280 285 

Ser Gly Cys Gin Leu Leu Ala His Glu Arg Thr Val Leu Val Gly Gly 

290 295 300 

Leu Phe Gin His Phe Asp Tyr Tyr Asn Ser Glu Lys Arg He Ser Lys 
305 310 315 320 

Asn Ser Glu Glu Val Leu Val Thr Pro Tyr Arg He Ser Gin Val Ala 

325 330 335 

Ala Leu Gly Gly Gly Leu He Phe Gin His His Gly Lys Phe Arg Arg 

340 345 350 

Arg Pro Leu Glu Leu Tyr Ala Glu Thr Tyr Leu Asn Val Val Pro Met 

355 360 365 

Gly Ala Ser Leu Ser Asp His Tyr Asn Val Asp Asn Arg Asp Tyr Asn 

370 375 380 

Leu Gly Ser Gly Leu Ser Gly Lys Leu Tyr Leu Gly Ala Thr Tyr Asn 
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385 390 395 400 

Asp Leu Trp Ser Trp Leu Leu Gly Val Glu Ser Tyr Arg Leu Tyr Thr 

405 410 415 

Trp lie Gly Tyr Glu Glu Pro His Gin Lys Asn Thr Asp Val Ser Ser 

420 425 430 

Phe Met Val Gin Gly Asp Glu Ser Lys Ala Arg Leu Leu Val Thr Ser 

435 440 445 

Ser Glu Phe Ala Phe His Pro Gly Pro Trp His Val Ala lie Val Ala 

450 455 460 

Arg Arg Phe lie Arg Lys Thr Ala Tyr Gin Phe Tyr Pro Asn Val Ser 
465 470 475 480 

Phe Asp Thr Gly Asp lie Gin Leu Arg Val Gly Phe His Phe 
485 490 

(2) INFORMATION FOR SEQ ID NO: 102 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



Met Lys Arg Leu He Val Phe Leu Ala Met Gly Gly Leu Leu Phe Thr 
1 5 io 15 
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Leu Ala Asn Ala Gin Glu Ala Asn Thr Ala Ser Asp Thr Pro Lys Lys 

20 25 30 

Asp Trp Thr He Lys Gly Val Thr Gly Leu Asn Ala Ser Gin Thr Ser 

35 40 45 

Leu Thr Asn Trp Ala Ala Gly Gly Glu Asn Thr Val Ala Gly Asn Leu 

50 55 60 

Tyr Leu Asn He Asp Ala Asn Tyr Leu Lys Asp Lys Trp Ser Trp Asp 
65 70 75 80 

Asn Gly Leu Arg Thr Asp Phe Gly Leu Thr Tyr Thr Thr Ala Asn Lys 

85 90 95 

Trp Asn Lys Ser Val Asp Lys He Glu Leu Phe Thr Lys Ala Gly Tyr 

100 105 HO 

Glu He Gly Lys His Trp Tyr Gly Ser Ala Leu Phe Thr Phe Leu Ser 

115 120 125 

Gin Tyr Ala Lys Gly Tyr Glu Lys Pro Ser Asp His Leu Thr Gly Val 

130 135 140 

Lys His He Ser Asn Phe Phe Ala Pro Ala Tyr Leu Thr Leu Gly He 
145 150 155 160 

Gly Ala Asp Tyr Lys Pro Asn Glu Lys Phe Ser Leu Tyr Leu Ser Pro 

165 170 175 

Thr Thr Gly Lys Leu Thr Val Val Ala Asp Asp Tyr Leu Ser Ser Leu 

180 185 190 

Gly Ala Phe Gly Val Lys Val Gly Glu Lys Thr Met Phe Glu Leu Gly 

195 200 205 

Ala Leu Val Val Gly Ser Ala Asn He Asn Leu Met Glu Asn Val Asn 

210 215 220 

Leu He Thr Lys Ala Ser Phe Phe Ser Ala Tyr Thr His Asp Phe Gly 
225 230 235 240 

Asn He Asp He Asn Trp Glu Ala Met Leu Ala Met Lys He Asn Lys 

245 250 255 

Phe Leu Thr Ala Thr He Ala Thr Asn Leu He Tyr Asp Asp Asp Val 

260 265 270 

Lys He Asn Asp Gly Pro Lys He Gin Phe Lys Glu Val Val Gly Val 

275 280 285 

Gly Val Ala Tyr Thr Phe 
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(2) INFORMATION FOR SEQ ID NO: 103 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 961 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

Met Lys Lys Leu Phe Pro Leu Leu Leu Leu lie Leu Ser lie Leu Val 

15 10 15 

Gly Cys Gly Lys Lys Glu Lys His Ser Val Thr Glu lie AJra Unk Glu 

20 25 30 

Lys Lys Arg lie Thr Ala Leu Leu Tyr. Glu Lys Glu Leu Pro Thr Asp 

35 40 45 

Ser Val Lys Gin Leu Tyr Glu Asn Ser Val Gin Asn Lys Asn Leu Val 

50 55 60 

Gly Gin Met Leu Phe Ala lie Glu Val Gly Lys Arg Met Arg Asn Met 
65 70 75 80 

Ser Gin Tyr Thr Asp Ala Met Leu Tyr His Gin Lys Gly Leu Asn Ala 

85 90 95 

Ala Leu Asn Leu Arg Asp Thr lie Val Ala Ala Gin Ala Trp Asn His 

100 105 110 

Leu Gly Thr Asp Ser Arg Arg lie Gly Ala Leu Ala Glu Ala Ser Asp 
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115 120 125 

Tyr His Tyr Lys Ala Leu Ser Leu lie Glu Ser Phe Ser Gly Asn Gin 

130 135 140 

Asn Arg Pro Ala lie Lys Ala Arg Ser Ala Ala Leu Asn Gly lie Gly 
5 145 150 155 160 

Asn lie Asn Leu Glu Leu Gly Tyr His Asp Glu Ala Glu Lys Asn Phe 

165 170 175 

Leu Lys Ala Leu Gin Gly Glu Lys Glu Leu Asp Ser Pro Leu Gly Gin 
180 185 190 

10 Ala lie Asn Tyr Ala Asn Leu Gly Arg lie Tyr Arg Gin Arg Lys Glu 
195 200 205 

Tyr Asp Lys Ala Arg Thr Tyr Phe Leu Leu Ser Leu Glu Gin Asn Asn 

210 215 220 

Met Ala Glu Asn Leu Met Gly lie Gly Leu Cys Ser lie Asn Leu Gly 
15 225 230 235 240 

Glu Val Asp Glu Glu Lys Gly Asp Tyr Gin Lys Ala Leu Gin Glu Tyr 

245 250 255 

Ala Thr Ala Tyr Lys Leu Met Glu Gin Leu Ser Asp Arg Trp His Trp 
260 265 270 

20 Leu Asn Ser Cys lie Pro Met Ala Arg lie Asn Leu Lys Gin Gly Asn 
275 280 285 

Glu Arg Leu Tyr Gin His Phe lie Ser Leu Ala Glu Gly Thr Ala Lys 

290 295 300 

Glu lie Asn Ser Thr Ser His Leu lie Glu lie Tyr Asn Leu Gin Tyr 
25 305 310 • 315 320 

Glu Asn Leu Glu Arg Lys Lys Glu Tyr Lys Gin Ala Leu Glu Ala Phe 

325 330 335 

Cys Leu Ser Lys Thr Leu Ser Asp Ser Met Ser lie Ala His Lys Val 
340 345 350 

30 Ser Ser lie Gin Glu Thr Arg Phe Asn Tyr Glu Arg Asn Lys Ser Gin 
355 360 365 

Lys Glu Leu Glu Glu lie Gin Gin Val Ser Lys Ala Lys Gin Glu Lys 

370 375 380 

Ser Lys Phe lie Leu Leu Ser Thr Leu Phe Ala Leu Phe lie Ser lie 
35 385 390 395 400 

Leu Leu lie Ser Val Leu Thr Tyr Ala Tyr Arg Gin Gly Lys Lys His 
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405 410 415 

Asn Lys Leu lie Lys Glu Thr Asp Lys Leu Arg Ser Gly Phe Phe Thr 

420 425 430 

Gly lie Thr His Glu Phe Arg Thr Pro He Thr Val He Gin Gly Leu 

435 440 445 

Asn Glu Lys Met Ser Ser Ser Pro Asp Leu Gin Ala Ser Asp Arg Thr 

450 455 460 

Glu Leu His Lys He He Asp Arg Gin Ser Ser His Met Leu Asn Leu 
465 470 475 480 

Val Asn Gin Leu Leu Asp He Cys Lys He Arg Ser Gly Val Ser Thr 

485 490 495 

Pro Glu Trp Arg Asn Gly Asp He Val Ser Phe Val Gin He Leu He 

500 505 510 

Asp Ser Phe Ala Pro Tyr Ala Gin Ala Gin Asp He Thr Leu Glu Leu 

515 520 525 

Gin Pro Glu Ser Lys Pro He Val Val Asp Phe Val Pro Ser Tyr Leu 

530 535 540 

Gin Lys He He Ser Asn Leu Leu Ser Asn Ala He Lys Tyr Ser Leu 
545 550 555 560 

Ala Gly Gly Arg Val Val He Ser Leu Ala Lys Thr Lys Asn Glu Lys 

565 570 575 

Asn Leu He He Arg Val Ala Asp Asn Gly He Gly He Asp Lys Thr 

580 585 590 

Asp Gin Ala His He Phe Asp He Phe Tyr Arg Gly Gin Ser Ala Thr 

595 600 605 

Glu Lys His Gly Ser Gly Val Gly Leu Ser Phe Thr Asn He Leu Val 

610 615 620 

Glu Asn Leu Arg Gly Thr He Lys Val Glu Ser Gin Pro Gly Lys Gly 
625 630 635 640 

Ser Ala Phe Thr He Ser He Pro Thr Gin Asn Gin Ser Ser Ser Ala 

645 650 655 

Glu He Leu Pro Trp Leu Pro Ser Ser Asp Asp He Val Met Pro Val 

660 665 670 

His He Ala Pro Asp Asp Ser Pro Thr Ser Pro Met Val Ala Ala Leu 

675 680 685 

Asn His Arg Phe Glu Asp Glu Arg Pro Thr He Leu Leu Val Glu Asp 
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690 695 700 

Asn Lys Asp lie Asn Leu Leu Val Lys Leu Leu Leu Cys Asp Arg Tyr 
705 710 715 720 

Asn Val Leu Ser Ala Ala Asn Gly Lys Glu Gly lie Ala Leu Ala Thr 
5 725 730 735 

Glu His lie Pro Asp lie lie lie Thr Asp lie Met Met Pro lie Met 

740 745 750 

Asp Gly lie Glu Met Thr lie Arg Met Lys Gin Ser Pro Leu Leu Cys 
755 760 765 

10 His He Pro He Val Ala Leu Thr Ala Lys Ser Thr Glu Gin Asp Arg 
770 775 780 

Leu Glu Gly He Lys Ser Gly Val Val Ser Tyr Leu Cys Lys Pro Phe 
785 790 795 800 

Ser Pro Glu Glu Leu Leu Met Arg He Glu Gin Leu Leu Lys Asp Arg 
15 805 810 815 

Glu Leu Leu Lys Lys Phe Tyr Met Gin Lys Leu Met Leu Asp Arg Lys 

820 825 830 

Pro Glu Glu Glu Pro Gin Pro He Asp Asp Ser Ser Met Gin Phe Leu 
835 840 845 

20 Leu Ala Ala Lys Asp Ala Val Ser Gly Gly He Lys Gin Asn Pro Asp 
850 855 860 

Phe Ser Ala Gin Asp Leu Ala Glu Lys Met Cys Met Ser Pro Ser Gin 
865 870 875 880 

Leu Asn Arg Lys Leu Thr Ser Val Val Gly Cys Ser Thr He Gly Tyr 
25 885 890 895 

He Gin Gin He Lys He Lys Leu Ala Cys Lys Leu Leu Ala Asp Glu 

900 905 910 

Ser Lys Asn He Ser Asp He Ser He Glu Ala Gly Phe Ser Asp Pro 
915 920 925 

30 Ala Tyr Phe Ser Arg Thr Phe Lys Arg Tyr Met Asn Cys Ser Pro Ser 
930 935 940 

Gin Tyr Arg Gin Lys Leu Leu Ala Met Pro Gly Ser Asp Lys Glu Thr 
945 950 955 960 

Val 

35 



227 



(2) INFORMATION FOR SEQ ID NO: 104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis cofeature 

(B) LOCATION 1. . .222 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

Met Lys His Leu Phe Lys Ser Thr Leu Val Leu Leu Cys Ala Leu Ser 

15 10 15 

Phe Ser Gly Thr Tyr Thr Phe Ala Gin Glu Asn Asn Thr Glu Lys Ser 

20 25 . 30^ 

Arg Phe Asp Phe Ser Val Arg Leu Gly Gin Gly Tyr lie Ala Gly Ser 

35 40 45 

Thr Thr Asn Leu Met Tyr Gly Tyr Thr Ser Ala Asn Asp Arg Leu Leu 

50 55 60 

Ser Gly Ala He Tyr Leu Gly Leu Thr Pro Ser Lys Lys Glu Asn Ala 
65 70 75 80 

Thr Gly Val Ala Phe Arg Phe Leu Ser Pro Ser Pro Gly Tyr Tyr Val 

85 90 95 

Asp He Ser Gly Lys Glu Asn Thr Leu Asn Tyr Ala Phe Tyr Val Val 

100 105 HO 

Gly Ala Tyr Asn Arg He Ala He Pro He Arg Pro He Lys Asn Phe 
115 120 125 
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Asn Phe lie Phe Ser Thr Glu Val Gly Met Ala Trp Met Ser Arg His 

130 135 140 

Glu Gin lie Tyr Asn Ser Thr Ser Gin Thr Trp Asp Lys Gin Arg Lys 
145 150 155 160 

Ser Arg Ser Gly Leu Asp Phe Gly Leu Gly Met His Leu Gin Unk His 

165 170 175 

lie Asn Lys Thr Val Tyr Phe Met Ala Gly Thr Asp Leu Thr Ser Cys 

180 185 190 

Met Phe Gly Lys Arg lie Asn Asp Tyr Gin Gin Lys Asp Arg Thr Phe 

195 200 205 

lie Ala Leu lie Asp Asn Ser lie Gly lie Gly Leu Asn Leu 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 105 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION 1...1158 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 



Met Lys Arg lie Leu Pro lie Val Ala Phe Leu Ser Leu Phe Leu Ala 

15 10 15 

Leu Ala Leu Pro Ala Lys Ala Gin Arg Ala Met Gly Lys Thr Ala Asp 
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20 25 30 

Arg Ser Leu Met Ala Ser Gly His Trp Val Lys He Arg Val Asp Ala 

35 40 45 

Ser Gly Val Tyr Arg Leu Thr Asp Glu Gin Leu Arg Ala Asn Gly Phe 

50 55 60 

Ser Asp Pro Ser Lys Val Gly Val Phe Gly Tyr Gly Gly Gly Val Leu 
65 70 75 80 

Pro Glu Asp Leu Ser Arg He Thr Thr Asp Asp Leu Pro Pro Val Pro 

85 90 95 

Val Leu Arg Gin Gly Asn Ala Leu Tyr Phe Tyr Ala Val Gly Pro Val 

100 105 HO 

Thr Trp Phe Tyr Asn Pro Ala Lys Thr Thr Met Glu His Thr Val Asn 

115 120 125 

Thr Tyr Ser Thr His Gly Tyr Tyr Phe Leu Ser Asp Ala Ala Gly Ala 

130 135 140 

Pro Leu Gin Met Ser Gin Tyr Thr Gly Gly Gly Ala Ser Ala Glu Ala 
X45 150 155 160 

Leu He Asp Tyr Tyr Asp Glu Leu Met Leu His Glu Gin Glu Leu Tyr 

165 170 175 

Ser Pro Lys Glu Ser Gly Arg Asp Leu Tyr Gly Glu Ser Phe Ser Ala 

180 185 190 

Val Asn Thr Arg Thr Val Lys Phe Pro Leu Arg Gly Asn Thr Arg Ser 

195 200 205 

Ser Gly Glu Leu Gly Thr Val Phe Ser Tyr He Ala Lys Ala Arg Ser 

210 215 220 

Ala Gly Gly Gly Arg Glu Met Ser Leu Ser Ala Asn Gly He Leu He 
225 230 235 240 

Phe Ser Asp Pro Phe Ser Met Thr Ser Asn Glu Val Ser Asn Ser Tyr 

245 250 255 

Leu Ala Gly Lys Lys Arg Arg Leu Tyr His Ser Thr Pro Met Asn Ser 

260 265 270 

Leu Val Asn Glu Leu Arg Leu Asp Ala Asn Tyr Ser Met Thr Gly Asp 

275 280 285 

Ala Val Asn Leu Asp Phe He Glu Val Ala Thr Gin Asn Asp Leu Arg 

290 295 300 

Tyr Asp Gly Ala Pro Met His He Arg Arg Phe Ser Asn Leu Pro Val 
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305 310 315 320 

Leu Gly Gly Glu Ser Cys Arg Phe Val lie Ser Glu Val Pro Glu Ser 

325 330 335 

Leu Val Val Leu Gin Ala Asn Ser Ser Leu Thr Ala Ser Leu Val Pro 

340 345 350 

Val Lys Thr Val Gly Asp Lys Thr lie Glu Phe Val Ala Pro Pro Lys 

355 360 365 

Gly Gin Asp Arg Arg Thr lie Asn Thr Phe Tyr Ala Val Asp Leu Ser 

370 375 380 

Gin Ala Ser Ala Pro Glu lie Leu Gly Ala Val Pro Asn Gin Asn Leu 
385 390 395 400 

His Gly Glu Glu lie Pro Asp Leu lie lie Val Ser Thr Gin Ala Leu 

405 410 415 

Leu Leu Glu Ala Asp Arg Leu Ala Thr Tyr Arg Arg Glu Lys Asn Gly 

420 425 430 

Leu Lys Val Leu Val Val Leu Gin Glu Gin Val Phe Asn Glu Phe Ser 

435 440 445 

Gly Gly Thr Pro Asp Ala Thr Ala Tyr Arg Leu Phe Ala Lys Met Phe 

450 455 460 

Tyr Asp Arg Trp Lys Ala Asn Ala Pro Val Gly Glu Thr Phe Pro Met 
465 470 475 480 

Gin Met Leu Leu Phe Gly Asp Gly Ala His Asp Asn Arg Lys Val Ser 

485 490 495 

Val Ala Trp Gin Lys Pro Tyr Leu Gin Gin Thr Glu Phe La-u Leu Thr 

500 505 510 

Phe Gin Ala Val Asn Ser Thr Asn Val Asn Ser Tyr Val Thr Asp Asp 

515 520 525 

Tyr Phe Gly Leu Leu Asp Asp Gin Pro Ala Ser Val Asn lie Gly Trp 

530 535 540 

Arg Asn Tyr Asn Met Ala Val Gly Arg Phe Pro Val Arg Thr Pro Ala 
545 550 555 560 

Glu Ala Arg lie Ala Val Asp Lys Thr lie Arg Tyr Glu Glu Asp Arg 

565 570 575 

Glu Ser Gly Ala Trp Arg lie Arg Ala Cys Phe Ala Ala Asp Asn Gly 

580 585 590 

Asp Lys His Ala Thr Glu Thr Ser Arg Leu lie Asp Thr Val Lys Arg 
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595 

Tyr Ala Pro Ala 
610 

His Val lie Glu 
625 

Met Leu Glu Thr 

His Gly Gly Pro 
660 

Asp lie His Lys 
675 

Thr Cys Asp Phe 
690 

Glu Val Phe Leu 
705 

Thr Arg Val Val 

Leu Arg Arg Met 
740 

Gly Glu lie He 
755 

Pro Asp Ser He 
770 

Val Arg Met Asn 
785 

Gly Gin Asp Pro 

Glu Arg Val Ala 
820 

Asp Glu Thr Phe 
835 

Lys Lys Met Thr 
850 

Tyr Tyr Asp Tyr 
865 

Asp Gly Leu Phe 



600 

He Met Pro Val 
615 

Asn Gly Leu His 
630 

Leu Gin Ser Gly 
645 

Ala Gly Trp Ser 

Phe Asn Tyr Lys 
680 

Ala Asn Tyr Asp 
695 

His Glu Lys Ser 
710 

Tyr Asn Thr Gin 

725 

Phe Glu Lys Ala 

Arg Ser Ala Lys 
760 

Asn Gin Leu Ser 
775 

Leu Pro Thr His 
790 

Glu Gly Gin Tyr 
805 

Leu Lys Gly Lys 

Ser Gly Lys Val 
840 

Ala Leu Glu Glu 
855 

Pro Asn Val Met 
870 

Glu Thr Ser Phe 



Arg Ala Phe Gin 
620 

Ser He Pro Gly 
635 

He He Leu Leu 
650 

Asp Glu His Leu 
665 

His Met Pro He 

Ser Gin Thr Thr 
700 

Gly Thr Pro lie 
715 

Asn Glu Lys He 
730 

Lys Asp Gly Arg 
745 

Gin Gly Met Leu 

Phe Phe Leu Met 
780 

Lys Val Gin Leu 
795 

Gly Thr He Met 
810 

Val Thr Asp Glu 
825 

Phe Leu Thr Val 

Glu Gly Asn Asp 
860 

Tyr Ala Gly He 
875 

He Val Pro Lys 



605 

Asp Val Tyr Pro 

Ala Lys Lys Lys 
640 

Asn Tyr Ala Gly 
655 

Leu Thr Leu Asn 
670 

Trp He Thr Ala 
685 

Ser Ala Gly Glu 

Met Phe Ser Thr 
720 

Asn Gly Phe Met 
735 

Tyr Arg Thr Met 
750 

Ser Thr Val Phe 
765 

Gly Asp Pro Ser 

Thr ALa He Asn 
800 

Leu Lys Ser Leu 
815 

Lys Gly Thr Phe 
830 

Phe Asp Gly Arg 
845 

Leu Ser Leu Val 

Ala Glu Val Lys 
880 

Asp Val Asn Tyr 
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885 890 895 

Ser Glu His Glu Gly Arg lie Asn Leu Tyr Ala Tyr Asn Glu Ser Thr 

900 905 910 

Lys Ala Glu Ala Met Gly Val Asp Phe Ser lie Arg Val Gin Pro Gly 

915 920 925 

lie Pro Asp Glu Val Thr Glu Asp Asn Thr Pro Pro Glu lie lie Ser 

930 935 940 

Cys Phe Leu Asn Asp Ser Thr Phe Arg Ser Gly Asp Glu Val Asn Pro 
945 950 955 960 

Thr Pro Leu Phe Met Ala Glu Val Phe Asp Leu Asn Gly lie Asn lie 

965 970 975 

Thr Gly Ser Gly Val Gly His Asp lie Thr Leu Cys lie Asp Gly Arg 

980 985 990 

Ala Asp Leu Thr Tyr Asn Leu Asn Ala Tyr Phe Thr Ser Ser Ala Thr 

995 1000 1005 

Asp Ala Gly Val Gly Thr lie Leu Phe Met lie Pro Ala Leu Ala Glu 

1010 1015 1020 

Gly Asp His Thr Ala Arg Leu Thr Val Trp Asp lie Phe Asn Asn Ala 
1025 1030 1035 1040 

Val His His Asp Phe Ser Phe Arg Val Val Asp Gly lie Ala Pro Asp 

1045 1050 1055 

Val Ala Asp Val lie Leu Phe Pro Asn Pro Val Arg Glu Ser Ala Thr 

1060 1065 1070 

Phe Arg lie Phe His Asn Arg Pro Gly Ser Asp Leu Asn VaJ. Ala Val 

1075 1080 1085 

Glu lie Tyr Asp Phe Thr Gly Arg Leu Val Asn Ser Leu Pro Val Lys 

1090 1095 1100 

Thr Tyr Ser Ser Ser Tyr Gly Glu Pro He Glu He Lys Trp Asp Leu 
11°5. 1110 1H5 H20 

Thr Ser Lys Tyr Gly Val Lys He Gly Asn Gly Phe Tyr Leu Tyr Arg 

1125 H30 H35 

Cys Val Val Asn Ser Pro Gly Gly Gin Thr Ala Ser Met Ala Lys Lys 

1140 H45 H50 

Met He Val Val Gly Gin 
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(2) INFORMATION FOR SEQ ID NO: 106 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME /KEY : mis c_f eature 

(B) LOCATION 1. . .391 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 



Met lie lie Lys 
1 

Leu Phe Leu Phe 
20 

Val Val His Thr 
35 

Ala Ala Gly Met 
50 

Ser Gin Tyr Trp 
65 

Gly lie Ser Phe 

lie Ala Leu Met 
100 

Asn Gin Ala lie 
115 



Lys Met Leu Lys 
5 

Ala Pro Lys Ala 

Ser Val Pro Ser 
40 

Gly Asp lie Gly 
55 

Asn Pro Ser Lys 
70 

Ser Tyr Thr Pro 
85 

Gin Met Thr Gly 

Ser Ala Ser Leu 
120 



Asn Lys Leu Ala 
10 

Met Lys Ala Gin 

25 

Leu Asn lie Ser 

Val Ala Thr Thr 
60 

Tyr Ala Phe Met 
75 

Trp Leu Ser Lys 
90 

Phe Tyr Lys Leu 
105 

Arg Tyr Phe Thr 



Pro Leu Ala lie 
15 

Glu Gin Leu Asn 

. 30- 
Pro Asp Ala Arg 
45 

Pro Asp Ala Tyr 

Asp Thr Lys Ala 
80 

Leu Val Asn Asp 
95 

Gly Thr Asp Glu 
110 

Leu Gly Lys Leu 
125 



234 



Glu Thr Phe Asp 
130 

Glu Phe Ala Val 
145 

Ser Met Ala Val 

Asn Thr Gly Glu 
180 

Gly Tyr Leu Gin 
195 

Ser Leu Gly Phe 
210 

Gly Gly Val Thr 
225 

Gly Leu Leu Tyr 

Glu Leu Ser Lys 
260 

Asp Gin Ala Gly 
275 

Ser lie Ser Gly 
290 

Lys Glu Glu Phe 
305 

Tyr Asp Asp Lys 

Thr Lys Gly Asn 
340 

Asn lie Phe Arg 
355 

Asn Pro Leu Asp 
370 

Gly Leu Arg Asn 
385 



Glu Leu Gly Glu 
135 

Asp Leu Gly Tyr 
150 

Ala Leu Arg Tyr 
165 

Asn Gin Ala Gly 

Lys Tyr Val Leu 
200 

Asn Val Lys Asn 
215 

Ser Phe Phe lie 
230 

Pro lie Asp Asp 
245 

Leu Leu Val Pro 

Tyr Glu Ala Ala 
280 

lie Phe Ser Ser 
295 

Arg Glu lie Thr 
310 

Phe Phe Val Arg 
325 

Leu Gin Tyr Phe 

lie Asp Ala Ser 
360 

Gin Thr Leu Arg 

375 

Leu Phe His 
390 



Ser Met Gly Glu 
140 

Ser Arg Gin Leu 
155 

lie Arg Ser Asp 
170 

Asn Ala Phe Ala 
185 

Leu Gly Asn Ala 

lie Gly Thr Lys 
220 

Pro Thr Ser Leu 
235 

Tyr Asn Ser lie 
250 

Thr Pro Pro lie 
265 

Leu Lys Lys Tyr 

Phe Gly Asp Ala 
300 

Trp Gly Leu Gly 
315 

Ala Gly Tyr Ser 
330 

Thr Ala Gly Ala 
345 

Tyr Leu Leu Ser 

Phe Thr Leu Ala 
380 



Ala His Pro Asn 

Ser Glu Asn Phe 
160 

Gin Ser Thr His 
175 

Ala Asp He Ala 
190 

Glu Ser Leu Trp 
205 

He Ser Tyr Asp 

Asn Leu Gly Thr 
240 

Asn Phe Asn Leu 
255 

Met Asp Gin Asn 
270 

Gin Glu Thr Ser 
285 

Pro Gly Gly Leu 

Ala Glu Tyr Ser 
320 

Tyr Leu His Pro 
335 

Gly Phe Lys Met 
350 

Thr He Gin Ser 
365 

Phe Asp Met Asp 



(2) INFORMATION FOR SEQ ID NO: 107 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1 . . .204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

Met Lys Lys Met lie Leu Ala Ala Thr Met Leu Leu Ala Thr lie Gly 

15 10 15 

Phe Ala Asn Ala Gin Ser Arg Pro Ala Leu Arg Leu Asp Ala Asn Phe 

20 25 30 

Val Gly Ser Asn Leu Met Gin Lys Val Ala Asn Thr Ser Val Asn Asn 

35 40 45 

Lys Met lie Val Gly Leu Arg Val Gly Ala Ala Ala Glu Phe Ala Leu 

50 55 60 

Ser Asn Asp Gly Phe Tyr Leu Ala Pro Gly Leu Ala Tyr Thr Met Arg 
65 70 75 80 

Gly Ala Lys Met Glu Ser Leu Ser Glu Thr Thr Thr Arg Leu His Tyr 

85 90 95 

Leu Gin lie Pro Val Asn Ala Gly Met Arg Phe Ser Phe Ala Asp Asn 

100 105 110 

Met Ala lie Ser Leu Glu Ala Gly Pro Tyr Phe Ala Tyr Gly Val Ala 

115 120 125 

Gly Thr He Lys Thr Lys Val Ala Gly Val Thr Ala Ser Val Asp Ala 
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130 135 140 

Phe Gly Asp Asn Gly Tyr Asn Arg Phe Asp Leu Gly Leu Gly Leu Ser 
145 150 155 160 

Ala Ala Leu Ser Tyr Asp Arg Tyr Tyr Val Gin lie Gly Tyr Glu His 

165 170 175 

Gly Leu Leu Asn Met Leu Lys Asp Ala Pro Asp Lys Thr Ser Leu Arg 

180 185 190 

Asn His Asp Phe Phe Val Gly Leu Gly Val Arg Phe 
195 200 

(2) INFORMATION FOR SEQ ID NO: 108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .455 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 

Met Lys Arg Thr He Leu Leu Thr Ala Leu Thr Val Leu Ser Ser Leu 

1 5 10 15 

Ser Leu Leu Arg Ala Gin Asn Glu Ser Glu Ala Ser Thr Asn Pro Met 

20 25 30 

Ser Gly Leu Ser Leu Glu Asp Cys He Arg He Ala Lys Glu Arg Asn 

35 40 45 
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Leu Asn Leu Arg Arg Gin Glu lie Glu Gin Glu Asn Arg lie lie Ser 

50 55 60 

Leu Asp Ala Ala Arg His Ser Phe Leu Pro Ser Val Asn Ala Gly lie 
65 70 75 80 

5 Gly His Asn Tyr Ser Phe Gly Arg Ser Lys Asp Lys Thr Gly Val Thr 

85 90 95 

Val Asp Arg Ser Ser Met Asn Thr Asn Leu Ser lie Gly Ala Ser Val 

100 105 110 

Glu Val Phe Ser Gly Thr Arg Arg Leu His Asp Leu Lys Gin Gin Lys 
10 115 120 125 

Tyr Asn Val Glu Asp Gly lie Ala Arg Leu Gin Lys Ala Arg Glu Asp 

130 135 140 

Leu Ser Leu Gin lie Ala Ala Leu Tyr lie Asn Leu Leu Phe Arg Gin 
145 150 155 160 

15 Glu Met Thr Arg Thr Ala Glu Thr Gin Leu Ala Leu lie Arg Glu Gin 

165 170 175 

Arg Asn Arg Thr Ala Glu Met Val Arg Val Gly Lys Trp Ala Glu Gly 

180 185 190 

Lys Leu Leu Asp lie Asn Ala Gin Met Ala Lys Asp Glu Gin Leu Leu 
20 195 200 205 

Val Gin Tyr Arg Ser Glu Glu Glu Leu Ala Arg Leu Asp Leu Gly Gin 

210 215 220 

Ala Leu Glu Leu Glu His Pro Glu Ser lie Ala Val Lys Ala Pro Asp 
225 230 235 . . 240 

25 Thr Asp Val Leu Val Ala Glu Arg Leu Gly Ser Leu Leu Ala Pro Glu 

245 250 255 

Glu lie Tyr Arg Thr Ala Leu Gly Leu Lys Pro Ala Leu His Ser Ser 

260 265 270 

Glu Leu Gin lie Ala Ser Ala Arg Glu Gly Leu Ala Ser Ala Arg Ala 
30 275 280 285 

Ala Tyr Phe Pro Thr Leu Ser Leu Ser Ala Gly Tyr Ser Asn Gly Tyr 

290 295 300 

Phe Arg Asp Leu Gly Lys Glu Tyr Ala Ala lie Asn Pro Ser Phe Ser 
305 310 315 320 

35 Glu Gin Trp Lys Asn Asn Gly Ser Tyr Ser lie Gly Leu Ser Leu Asn 

325 330 335 
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lie Pro lie Phe Ser 
340 

Arg Leu Gin lie Arg 
355 

Ala Leu Tyr Lys Glu 
370 

Asp Lys Ala lie Ala 
385 

Ala Tyr Glu Tyr Ala 
405 

Tyr Glu Tyr Ala Glu 
420 

Glu Leu Arg Ala Lys 
435 

Phe Tyr Gin Gly Lys 
450 



Ala Met Gin Thr Gin Asp 
345 

Ser Ser Glu Leu Arg Leu 
360 

He Arg Gin Ala Tyr Ser 
375 

Ala Ala Glu Asn Ser Lys 
390 395 
Arg Asp Ser Phe Glu Ala 
410 

Ala Lys Thr Lys Tyr Ala 
425 

Tyr Asp Phe He Tyr Lys 

440 
Asp Phe 
455 



Arg Val Arg Ser Ser 
350 

Val Glu Glu Lys Lys 
365 

Asn Ala Val Ala Ala 
380 

Ala Ala Thr Leu Lys 
400 

Gly Arg Leu Ser Ala 
415 

Leu Ser Gin Val Glu 
430 

Ala Lys Val Leu Asp 
445 



(2) INFORMATION FOR SEQ ID NO: 109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . . 434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 
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Met Tyr Lys Asp Tyr Lys Gly Leu Tyr Ala Ser Leu Arg Trp Tyr Ala 
15 10 15 

Leu lie lie Gly Leu Leu Phe Ala Ala Asp Gly lie Gin Ala Gin Asn 
5 20 25 30 

Asn Asn Phe Thr Glu Ser Pro Tyr Thr Arg Phe Gly Leu Gly Arg Leu 

35 40 45 

Gly Glu Arg Thr Thr lie Ser Gly His Ser Met Gly Gly Leu Gly Val 
50 55 60 

10 Gly Leu Arg Gin Gly Thr Tyr Val Asn Ala Val Asn Pro Ala Ser Tyr 
65 70 75 80 

Ser Ala Val Asp Ser Met Thr Phe lie Phe Asp Phe Gly Ala Ser Thr 

85 90 95 

Gly lie Thr Trp Tyr Ala Glu Asn Gly Lys Lys Asp Asn Arg Lys Met 
15 100 105 110 

Gly Asn lie Glu Tyr Phe Ala Met Leu Phe Pro lie Ser Lys Ser lie 

115 120 125 

Ala Met Ser Ala Gly Val Leu Pro Tyr Ser Ala Ser Gly Tyr Gin Phe 
130 135 140 

20 Gly Ser Val Asp Gin Val Glu Gly Gly Ser Val Gin Tyr Thr Arg Lys 
145 150 155 160 

Tyr Leu Gly Thr Gly Asn Leu Asn Asp Leu Tyr Val Gly lie Gly Ala 

165 170 175 

Thr Pro Phe Lys Asn Phe Ser lie Gly Ala Asn Ala Sex Ser Leu Phe 
25 180 185 190 

Gly Arg Phe Thr His Ser Arg Gin Val lie Phe Ser Thr Glu Ala Pro 

195 200 205 

Tyr Asn Pro Val His Leu Ser Thr Leu Tyr Leu Lys Ala Ala Lys Phe 
210 215 220 

30 Asp Phe Gly Met Gin Tyr His Leu Leu Leu Lys Ser Asp Arg Ser Leu 
225 230 235 240 

Val lie Gly Ala Val Tyr Ser Pro Arg Val Lys Met His Ser Glu Leu 

245 250 255 

Thr Gin lie Lys Asn Gin Val Gin Asn Gly Val Val Val Glu Ser Glu 
35 260 265 270 

Thr Gin Glu Tyr lie Lys Gly Met Asp Tyr Tyr Thr Leu Pro His Thr 
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275 280 285 

Leu Gly He Gly Phe Ser Tyr Glu Lys Lys Asp Lys Leu Leu Leu Gly 

290 295 300 

Ala Asp Val Gin Tyr Ser Lys Trp Lys Gly Glu Lys Phe Tyr Lys Ser 
305 310 315 320 

Asp Cys Lys Phe Gin Asp Arg He Arg Val Ser Leu Gly Gly Glu He 

325 330 335 

He Pro Asp He Asn Ala Val Gly Met Trp Pro Lys Val Arg Tyr Arg 

340 345 350 

Phe Gly Leu His Gly Glu Asn Ser Tyr Leu Lys Val Pro Thr Lys Gly 

355 360 365 

Gly Val Tyr Gin Gly Tyr His He Val Gly Ala Val Phe Gly He Gly 

370 375 380 

He Pro Leu Asn Asp Arg Arg Ser Phe Val Asn Val Ser Leu Glu Tyr 
385 390 395 400 

Asp Arg Leu He Pro Lys Glu Gly Met He Lys Glu Asn Ala Leu Lys 

405 410 415 

Leu Thr Phe Gly Leu Thr Phe Asn Glu Ser Trp Phe Lys Lys Leu Lys 
420 425 

Leu Asn 



430 



(2) INFORMATION FOR SEQ ID NO: 110 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1 . . . 926 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 



Met Arg Ser lie 
1 

Gly Phe Val Gly 
20 

Leu Asp Glu Glu 
35 

Gin Ser Thr Gly 
50 

Gly Leu Phe Ser 
65 

Val Ser Tyr Val 

Asn Gly Gin Thr 
100 

Arg Leu Leu Gin 
115 

Val Arg Asn Asp 
130 

Gin Gly Ala Ser 
145 

lie Gly Ser Asp 

lie Leu Val Asp 
180 

lie Lys Asn Leu 
195 

Lys Leu Ser Glu 
210 

Glu Thr Val lie 
225 



Tyr Gin Leu Leu 
5 

Leu Glu Ala Gin 

Gly Asn Pro Met 
40 

Gin Val Ala Val 
55 

Leu Lys Thr Ser 
70 

Gly Tyr Thr Thr 
85 

lie Thr Leu Lys 

Ser Val Thr Val 
120 

Thr Leu Glu Phe 
135 

lie Glu Glu Leu 
150 

Gly Lys He Thr 
165 

Gly Lys Glu Phe 

Pro Ala Asp Met 
200 

Leu Ser Arg Met 
215 

Asn Leu Thr Val 
230 



Leu Ser He Leu 
10 

Gin Ala Gly Val 

25 

He Gin Ala Asn 

Ala Ala Gly Ala 
60 

Gin Glu Gly Asp 
75 

His Asp Glu Lys 
90 

Asp He Ser Met 
105 

Gin Ala Lys Ala 

Asn Ala Gly Ser 
140 

He Lys Lys Leu 
155 

He Asn Gly Lys 
170 

Phe Ser Lys Asp 
185 

Val Asn Lys Val 

Ser Gly Phe Asp 
220 

Lys Pro Glu Lys 
235 



Leu Ala Ser Leu 
15 

Ala Gly Arg Val 
30 

Val Gin Leu Val 
45 

Thr Asn Glu Lys 

Tyr He Leu Arg 
80 

He Ser Leu Arg 
95 

Asn Glu Asp Ala 
110 

Ala Glu Val Val 
125 

Tyr Thr Val Ala 

Pro Gly Ala Glu 
160 

Asp He Ser Lys 
175 

Pro Gin Val Ala 
190 

Gin Val Leu Asn 
205 

Asp Gly Glu Glu 

Lys Lys Gly Leu 
240 
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Phe Gly Thr Leu Gin Ala Gly Tyr Gly Thr Asp Gin Arg Tyr Met Ala 

245 250 255 

Gly Gly Asn Val Asn Arg Phe Asp Gly Asn Lys Gin Trp Thr Leu lie 

260 265 270 

Gly Ser Ala Asn Asn Thr Asn Asn Met Gly Phe Ser Glu Met Asp Ser 

275 280 285 

Glu Met Gly Ser Met Thr Phe Phe Ser Pro Gin Gly Gly Gly Arg Arg 

290 295 300 

Gly Phe Gly Asn Ser Gly Gly Val Thr Ser Ser Ser Met Leu Gly Gly 
305 310 315 320 

Asn Phe Ser Val Glu Phe Ser Ser Ala Leu Asn Thr Gly Gly Asp Ala 

325 330 335 

Arg Tyr Gly Tyr Asn Asp Lys Ala He Glu Thr Thr Lys Arg Val Glu 

340 345 350 

Asn He Leu Ala Glu Gly Asn Thr Tyr Met Asp Glu Asn He Leu Glu 

355 360 365 

Arg Ser Phe Ser His Asn Gly Gin Ala Arg Phe Arg Met Gin Trp Lys 

370 375 380 

Pro Ser Glu Arg Thr Glu Val Val Phe Glu Pro Asp Leu Ser lie Ser 
385 390 395 400 

Lys He Asp Gly Phe Phe Asn Asp Thr Tyr Glu Thr Lys Asp Ala Thr 

405 410 
Gly He Ser He Asn Lys Gly Ser He His Gin Thr Thr Gin Gly Asn 

420 425 43U0 

Asn Phe Arg Leu Asn Gly Glu Leu Asp He Ser His Lys Leu Asn Asp 

435 440 445 

Glu Gly Arg Thr He Ser Ala Ser Val Ser Gly Gly Leu Thr Asp Glu 

450 455 46Q 

Asp Gly Asp Gly He Tyr Gin Ala Val Leu Gin Ser Val Glu Thr Asn 
465 470 475 480 

Gin Lys Gin Phe Asn Asp Asn Ser Asn Leu Gin Tyr Arg Leu Arg Leu 

485 490 495 

Ser Tyr Val Glu Pro Leu Gly Lys Asn Tyr Phe Ala Gin Ala He Leu 

500 505 510 

Asn Arg Arg Phe Ser Arg Arg Asn Ser Asp Arg Glu Val Tyr Arg Leu 
515 520 525 
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Gly Asp Asp Gly 
530 

Tyr Ser Asn Glu 
545 

lie Ala Lys Thr 

Arg Thr Val Ser 
580 

Ala Phe Asn Arg 
595 

Pro Ser Arg Thr 
610 

Gin Pro Ser lie 
625 

Leu Phe Val Thr 

Asn Val Met Ala 
660 

Phe Asn lie Val 
675 

Asn Thr His Tyr 
690 

Asn Ala Ser Gly 
705 

Pro Leu Lys Asn 

Leu Ala Glu Gly 
740 

Ser Phe Arg Thr 
755 

lie Asp Thr Ser 
770 

Ser Leu Ser Gly 
785 

Tyr Gin Val Ala 



Gin Tyr Ser lie 
535 

Phe Thr Gin Tyr 
550 

Trp Asp Tyr Thr 
565 

Tyr Arg Ser Val 

Val Asn Leu Ser 
600 

Thr Asn Leu Arg 
615 

Asn Gin lie Ala 
630 

Glu Gly Asn Pro 
645 

Met Phe Ser Asp 

Phe Phe Gly Asn 
680 

Asp Pro Ser Thr 
695 

Thr Trp Gin Ala 
710 

Arg Ala Phe Ser 
725 

Gin Ser Phe lie 

Arg Glu Arg Leu 
760 

lie Gly Gly Asn 
775 

Gin Lys Asp Ser 
790 

Leu Thr Leu Pro 
805 



Leu Asp Ser Gin 
540 

Arg lie Gly Leu 
555 

Val Gly Phe Asn 
570 

Ala Gly Val Glu 
585 

Pro Met Leu Arg 

Val Asp Tyr Arg 
620 

Pro Val Gin Asp 
635 

Gly Leu Lys Pro 
650 

Phe Asp Ala Lys 
665 

Tyr Thr Phe Asp 

Gly lie Arg Thr 
700 

Asn Leu His Gly 
715 

Phe Arg Met Ser 
730 

Asn Asp Asp Lys 
745 

Thr Leu Thr Tyr 

lie Gly Phe Tyr 
780 

Arg Thr Tyr Asp 
795 

Tyr Gly Phe Arg 
810 



Tyr Gly Leu Ser 

Asn Leu Lys Lys 
560 

Val Asp Pro Asn 
575 

Gin Asp Lys Leu 
590 

lie Asn Tyr Lys 
605 

Gly Arg Thr Thr 

lie Thr Asn Pro 
640 

Ser Tyr Ser Asn 
655 

Ser Gin Arg Ala 
670 

Asp lie Val Pro 
685 

Thr Arg Tyr Glu 

Thr Leu Ser Leu 
720 

Leu Phe Asn Arg 
735 

Asn Lys Ala Leu 
750 

Arg Asn Asn Trp 
765 

Met Ala Asn Asn 

Phe Gly Gly Asn 
800 

lie Asp Ser Asp 
815 
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Val Glu Tyr Asn Thr Asn Ser Gly Tyr Ser Gly Gly Phe Ser Leu Asp 

820 825 830 

Glu Trp Leu Trp Asn Ala Ser Leu Ser Tyr Ser Phe Leu Arg Asp Lys 

835 840 845 

Ala Gly Thr Leu Arg Val Asn Gly Tyr Asp lie Leu Gly Gin Arg Ser 

850 855 860 

Ser lie Ser Arg Ser Ala Ser Ala lie Asn lie Glu Glu Ser Met Ser 
865 870 875 880 

Asn Thr lie Gly Arg Tyr Val Met Val Asp Phe lie Tyr Arg Phe Asn 

885 890 895 

Ala Phe Ser Gly Gly Gly Ser Arg Ser Asp His Gin Arg Gly Asn Met 

900 905 910 

Asn Arg Pro Gly Pro Pro Phe Gly Gly Gly Arg Arg Pro Ser 
915 920 925 

(2) INFORMATION FOR SEQ ID NO: 111 

(i) SEQUENCE -CHARACTERISTICS : 

(A) LENGTH: 781 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 
<B) LOCATION 1 ... 781 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 

Met Asp Arg Pro Lys Pro Ser Tyr lie Val Arg lie Ala Ala lie Leu 
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1 

Cys Leu Phe Val 
20 

Val Asp Pro Leu 
35 

Asn Thr Tyr Pro 
50 

Pro Met Thr Gly 
65 

His Lys lie Arg 

Asn Asp Tyr Gly 
100 

Pro Ser Ser Asn 
115 

Phe Ser Asp Glu 
130 

Thr Pro Tyr Tyr 
145 

Glu Met Ala Pro 

Gly Asn Thr Glu 
180 

Thr Gly Gly Ser 
195 

lie Ser Arg Lys 
210 

Phe lie Leu Gin 
225 

Asp Thr Gly Lys 

Phe Asp Ser Gin 
260 

Val Glu Gin Ala 
275 

Asp Arg lie Arg 



5 

Gly Arg Pro Leu 

lie Gly Thr Leu 
40 

Val lie Gly Leu 
55 

Val Pro Gly Asp 
70 

Gly Phe Lys Gin 
85 

Gin Phe Ser Leu 

Asp Ser lie Ala 
120 

Gin Thr Ser Trp 
135 

Tyr Ser Val Tyr 
150 

Thr Glu Arg Ala 
165 

Ser Gly Ser Gly 

Glu lie Ser lie 
200 

Asn Ser Gly Gly 
215 

Ser Asp Thr Pro 
230 

Ser Asp Glu Gly 
245 

Glu Val Thr Val 

Glu Arg Asn Leu 
280 

Leu Ala Gly Arg 



10 

Phe Ala Gin Ser 
25 

Ser Ser Phe Glu 

Pro Trp Gly Met 
60 

Gly Trp Gin Tyr 
75 

Thr His Gin Pro 
90 

Leu Pro Leu Thr 
105 

Leu Thr Lys Trp 

Phe Ser His Lys 
140 

Leu Ala Asp Tyr 
155 

Ala lie Phe Arg 
170 

Arg Trp Leu Arg 
185 

Val Asp Pro His 

Val Pro Ala Asn 
220 

Met Ala Asp Val 
235 

Thr Arg Ala Trp 
250 

Arg Val Ala Ser 
265 

Ala Glu Val Lys 
Glu Ala Trp Asn 



15 

Tyr Val Asp Tyr 
30 

Leu Ser Ala Gly 
45 

Asn Ser Trp Thr 

Thr Tyr Ser Ala 
80 

Ser Pro Trp lie 
95 

Ala Pro Gin Lys 
110 

Cys Lys Gin Leu 
125 

Ala Glu Thr Ala 

Asp Thr Arg Val 
160 

lie Arg Tyr Ser 
175 

Leu Asp Ala Phe 
190 

Thr- Vat Val Gly 
205 

Phe Ala Cys Tyr 

Leu Leu Glu Thr 
240 

Ala Ala Cys Arg 
255 

Ser Phe lie Ser 
270 

Gly Gin Ser Phe 
285 

Lys Val Leu Gly 



290 

Arg lie His Val 
305 

Ser Ala Leu Tyr 

Asp Ala Ser Gly 
340 

Leu Pro Gly Tyr 
355 

Ala Leu Phe Pro 
370 

lie Gin Glu Gly 
385 

Glu Trp Ala Ser 

Ala Ser Val Leu 
420 

Thr Arg Thr Leu 
435 

Pro Lys lie Ser 
450 

Leu Gly Tyr Val 
465 

Thr Leu Glu Tyr 

Thr Leu Gly Trp 
500 

Met Asn Tyr Arg 
515 

Arg Asn Gin Asp 
530 

Gly Asp Val Phe 
545 

Phe His Asp Val 
Phe Val Ser Met 



295 

Glu Gly Gly Thr 
310 

Arg Cys Leu Leu 
325 

Asn Phe Val His 

Leu Tyr Thr Asp 

360 

Leu Leu Asn Leu 
375 

Leu Leu Asn Val 
390 

Pro Gly His Arg 
405 

Ala Asp Ala Tyr 

Met Asn Gly Leu 
440 

Ser Thr Gly Arg 
455 

Pro Ala Asp Ala 
470 

Ala Tyr Asn Asp 
485 

Asp Arg Ala Ala 

His Leu Phe Asp 
520 

Gly Ser Phe Arg 
535 

Thr Glu Gly Asn 
550 

Gin Gly Leu He 
565 

Leu Asp Ser Val 
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300 

Lys Asp Glu Arg 
315 

Phe Pro Arg Arg 
330 

Tyr Ser Pro Tyr 
345 

Thr Gly Phe Trp 

Leu Tyr Pro Asp 
380 

Tyr Arg Glu Ser 
395 

Asp Cys Met He 
410 

Leu Lys Gly Val 
425 

Leu His Ala Thr 

Lys Gly Trp Glu 
460 

Gly He Asp Glu 
475 

Trp Cys He Leu 
490 

Leu Asp Thr Leu 
505 

Pro Glu Thr Lys 

Thr Pro Phe Ser 
540 

Ala Trp His Tyr 
555 

Asp Leu Met Gly 
570 

Phe Asn Thr Pro 



Thr Thr Phe Tyr 
320 

Phe Tyr Glu Glu 

335 

Asn Gly Glu Val 
350 

Asp Thr Phe Arg 

365 

Glu Asn He Lys 

Gly Phe Phe Pro 
400 

Gly Asn Asn Ser 
415 

Arg Val Glu Asp 
430 

Lys Ala Val His 
445 

Trp Tyr Asn Ser 

Ser Ala Ala Arg 
480 

Arg- Leu Gly Arg 
495 

Ala His Arg Ser 
510 

Leu Met Arg Gly 
525 

Pro Phe Lys Trp 

Thr Trp Ser Val 
560 

Gly Asp Arg Pro 
575 

Pro Met Phe Asp 
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580 

Glu Ser Tyr Tyr 
595 

Ala Asp Met Gly 
610 

lie Tyr Leu Tyr 
625 

Leu Arg Glu Val 

Cys Gly Asp Glu 
660 

Ala Leu Gly Phe 
675 

Gly Ser Pro lie 
690 

Lys Thr Val Leu 
705 

Arg Ser lie Ser 

His Glu Gin Leu 
740 

Lys Pro Asn Tyr 
755 

Phe Ser Thr Glu 
770 



Gly Phe Val lie 
600 

Asn Tyr Ala His 
615 

Asn His Ala Gly 
630 

Met Gly Arg Leu 
645 

Asp Asn Gly Gin 

Tyr Pro Val Thr 
680 

Phe Ser Lys Val 
695 

His Ala Pro Ala 
710 

Val Glu Gly Lys 
725 

Arg Ser Ser Ala 

Asn Arg Gly Met 
760 

Gin Gin Arg Arg 
775 



585 

His Glu lie Arg 

Gly Asn Gin Pro 
620 

His Pro Trp Lys 
635 

Tyr Arg Pro Thr 
650 

Thr Ser Ala Trp 
665 

Pro Ala Thr Asp 

lie Leu Ser Phe 
700 

Asn Ser Ala Asp 
715 

Glu Trp Ser Cys 
730 

Ser lie Gin Trp 
745 

Lys Glu Ser Asp 

Ala Asn His Ser 
780 



590 

Glu Met Gin lie 
605 

lie Gin His Met 

Ala Gin Glu Arg 
640 

Pro Asp Gly Tyr 
655 

Tyr Val Phe Ser 
670 

Gin Tyr Val Leu 
685 

Pro Asp Gly His 

Thr Pro Tyr lie 
720 

Asn Tyr Leu Thr 
735 

Met Met Asp Thr 
750 

Arg Pro Tyr Ser 

765 

Asn 



(2) INFORMATION FOR SEQ ID NO: 112 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



5 (ix) FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . . 446 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

10 

Met Lys Thr Thr Val Gin Gin He He Leu Cys Leu Ala Leu Met Met 
15 10 15 

Ser Gly Val Leu Gly Gly Asn Ala Gin Ser Phe Trp Glu Glu He Ala 
20 25 30 

15 Pro Pro Phe He Ser Asn Glu Pro Asn Val Lys Tyr He He Pro Asn 
35 40 45 

Met Gly He Asp Ser Lys Gly Thr He Tyr Val Thr Val Thr Lys Arg 

50 55 60 

He Gin Gin Gly Ala Asn Tyr Thr Ser Glu Gin Leu Gly Met Tyr Tyr 
20 65 70 75 80 

Arg Pro Leu Gly Asp Asn Glu Gin Trp Trp Lys His Asp Pro Tyr Phe 

85 90 95 

Asp Asp Lys He Val Ala Asp He Gin Thr Asp Ala Tyr Gly Arg Val 
100 105 - ne 

25 Tyr Val Cys Thr Thr Ser Ser Arg Asp Gin Glu Tyr Gin Leu Tyr He 
115 120 125 

Asn Glu Gin Asn Glu Trp Arg Cys He Phe Lys Thr Ser Val Ser Thr 

130 135 140 

Tyr Glu His Gly Met Ala Val Phe Arg Ser Ser Thr Gly Val Thr Tyr 
30 145 150 155 160 

He Gly Thr Arg His His He Phe Ala Ser Gly Val Asn Asp Phe Glu 

165 170 175 

Phe Asn Thr He Tyr Glu Asp Ser Thr Pro Met Ser Cys Arg Phe Ala 
180 185 190 

35 Glu Ala Thr Asn Ser Gly Thr He Tyr Leu Ala Leu Met His Glu Thr 
195 200 205 
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Thr Met Ser Thr 
210 

He Ser Glu Ser 
225 

Asn Lys Glu Gly 

Met Ser Gly Thr 
260 

Leu Val Gly Gly 
275 

Met Asp Asp Asn 
290 

Val Asp Gly Arg 
305 

Phe Glu Trp Tyr 

Phe Phe Tyr Ser 
340 

He Met Leu Arg 
355 

Thr Val Val Gin 
370 

He Glu Ser Glu 
385 

Gly Arg Met Val 

He Asp Thr Asn 
420 

Arg Leu Ser Ser 
435 

(2) INFORMATION 



Thr He Leu Thr 
215 

Glu Leu Ser Asn 
230 

Asp He He Ala 
245 

Leu Ala He Arg 

Asp He Gin Asn 
280 

Lys He Ala Cys 
295 

Thr Arg Val Cys 
310 

Glu Asp Glu He 
325 

Pro Trp Asp Lys 

Ser Lys Glu Ser 
360 

Gly Val Asp Val 
375 

Thr Pro Val Ser 
390 

Leu Arg Gin Thr 
405 

Gly Leu Lys Arg 

Gly Gin Val Phe 
440 

FOR SEQ ID NO: 1] 



Tyr Gin Asn Gly 
220 

Ser He He Ala 
235 

Leu Val Thr Ser 
250 

Lys Ala Asp Glu 
265 

Ala He Val Gin 

Glu Val Phe Gly 
300 

Val Ser Asp Ala 
315 

Tyr Gly Gly Leu 
330 

Leu Leu Tyr Ala 
345 

Phe He Thr Ser 

Tyr Thr Leu Ala 
380 

Glu Val Leu Leu 
395 

He Asp Asn Lys 
410 

Ser Gly lie Tyr 
425 

Ser His Lys Val 
3 



Glu Phe Val Asp 

Ser Met Cys Ser 
240 

Tyr Thr Gly Phe 

255 

Gly Lys Trp Gin 
270 

Asn He Cys Met 
285 

Thr Pro Asn Gly 

Ser Val Phe Asp 
320 

He Phe Asp Thr 
335 

Lys Phe Gly Gly 
350 

Phe He Ser Pro 
365 

Gly Lys He Arg 

Phe Asp Leu Ala 
400 

He Tyr Ser Asp 
415 

Val Val Ser Val 

430 
Gin Val 
445 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 
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( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

10 (ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1. . .303 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 



Met lie lie Arg Cys Leu lie Arg Arg Pro Arg Thr Val Leu Phe Gly 
15 10 15 

Leu lie Phe Val Val Gly Leu Phe Ser Ala Met Ala Gin Glu Lys Lys 
20 25 30. 

20 Asp Ser Leu Ser Thr Val Gin Pro Val Pro Asn Ser Ser Met Val Glu 
35 40 45 

Gin Thr Pro Leu Leu Ser lie Asp His Pro Val Leu Pro Ala Ser Phe 

50 55 60 

Gin Asn Thr Arg Thr Leu Lys Arg Phe Arg Asp Lys His- Leu Ser Asp 
25 65 70 75 80 

Ala Leu Leu Asn Gly Leu Lys Pro His Arg Ser Ser Leu Gin Leu Asn 

85 90 95 

Glu Glu Leu Asn Phe Ala Ala Glu Arg Arg Asp Phe Val Ser Pro Leu 
100 105 110 

30 Leu Gin Thr Arg His Ala Ala Gly Val Leu Ser Trp Arg Pro Thr Asp 
115 120 125 

Arg Met His Phe Tyr Thr Ser Gly Asn lie Gly Leu Gly His Asp Leu 

130 135 140 

Leu Thr Gly Val Arg Lys Asp Phe Gly Trp Asn Ala Gly Ala Asp Phe 
35 145 150 155 160 

Leu Leu Ser Gin Asn Leu Thr Ala His Val Gin Gly Gly Trp Gin Gin 



251 



165 170 175 

Asn Phe Gly Phe lie Pro Met Thr Ala Val Asn Gly Gin Leu Arg Trp 

180 185 190 

Gin Ala Thr Glu Arg Leu Ser Phe Thr Thr Gly lie Asp Tyr Arg Gin 
5 195 200 205 

Val Gin Trp Asn Ala Phe Asp Asn Arg Thr Phe Ser Leu Lys Gly Ser 

210 215 220 

Ala Arg Tyr Glu Val Met Asp Asn Val Phe Val Asn Gly Phe Gly Ser 
225 230 235 240 

10 Tyr Pro Leu Tyr Ser Ser Thr Arg Ser Gly Leu Asn Met Ala Val Pro 

245 250 255 

Met His Gly Phe Gly Pro Gin Tyr Gly Gly Ser Leu Glu Leu Lys Val 

260 265 270 

Ser Glu Arg Phe Gly Phe Ala Val Gly Met Glu Arg Glu Tyr Asn lie 
15 275 280 285 

Trp Thr Arg Arg Trp Glu Thr His Tyr Phe Ala Tyr Pro Val Phe Tyr 

290 295 300 

Gly Asp Lys Lys 
305 



20 



(2) INFORMATION FOR SEQ ID NO: 114 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 3 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

35 (ix) FEATURE: 

(A) NAME/KEY: misc feature 



4 



252 



(B) LOCATION 1. . .243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

5 Met Lys Arg lie Phe Thr Val Ala Leu Val Leu Leu Ala Ser Val Thr 
15 10 15 

Met Ala lie Gly Gin Ser Arg Pro Ala Leu Arg Val Asp Ala Asn Phe 

20 25 30 

Val Gly Ser Asn Gin Ser Met Lys Arg Asp Gly Tyr Val Trp Asp Thr 
10 35 40 45 

Lys Met Asn Val Gly Leu Arg Val Gly Ala Ala Ala Glu Phe Met lie 

50 55 60 

Gly Ser Arg Gly Phe Tyr Leu Ala Pro Gly Leu Asn Tyr Thr Met Lys 
65 70 75 80 

15 Gly Ser Lys Thr Glu Trp Asp lie Pro Glu Met Val Pro Gly Thr Tyr 

85 90 95 

lie Thr Met Val Ser Thr Arg Leu His Tyr Leu Gin Leu Pro lie Asn 

100 105 110 

Ala Gly Met Arg Phe Asp Leu Met Asn Asp Met Ala Val Ser lie Glu 
20 115 120 125 

Ala Gly Pro Phe Leu Ala Tyr Gly lie Tyr Gly Thr Tyr Arg Gin Lys 

130 135 140 

Leu Glu Gly Trp Lys Pro Asn Asn Tyr Ser Thr Glu Phe Phe Gly Pro 
145 150 155 - - 160 

25 Thr Leu Gly Gly Pro Thr Asn lie Arg Trp Asp lie Gly Ala Asn lie 

165 170 175 

lie Ala Ala Phe His Tyr Lys Arg Tyr Tyr lie Gin He Gly Tyr Glu 

180 185 190 

His Gly Phe Val Asp He Val Ser Gly Gly Gly Ser Asp He Pro Arg 
30 195 200 205 

Leu Asn Asp Asn Arg Gin Ser Ser Ser Thr Thr Ala Leu Arg Glu Lys 

210 215 220 

Gly Asn Asn Glu Tyr Ala Tyr Asn Arg Asp Phe Phe Val Gly He Gly 
225 230 235 240 

35 Tyr Arg Phe 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 540 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 

Met Lys Thr Lys Val Leu Arg Lys Phe Val Val Ala Ala Phe Ala Val 

15 10 15 

Ala Thr Leu Cys Pro Leu Ala Gin Ala Gin Thr Met Gly. Gly Asp Asp 

25 20 25 30 

Val Lys Val Val Gin Tyr Asn Gin Glu Lys Leu Val Gin Thr Arg Met 

35 40 45 

Ser Val Ala Asp Asn Gly Trp lie Tyr Val Met Thr His Ser Gly Tyr 
50 55 60 

30 Asp Thr Gly Asn Ser Asn Val Lys lie Phe Arg Ser Lys Asp Gin Gly 

65 70 75 80 

Ala Thr Tyr Gin Lys Leu Arg Asp Trp Asp Pro Ser Asp Asp Tyr Gin 

85 90 95 

Phe Gin Asp Phe Asp lie Val Val Thr Gly Lys Asn Glu Ser Asp lie 

35 100 105 110 

Lys lie Trp Ser Val Glu Leu Met Asn Lys Pro Gly Gly Tyr Lys Ser 
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115 

Arg Val Ala Val 
130 

Val Tyr Lys Glu 
145 

Ala Ser Asn Tyr 

Phe Ala Leu Ala 
180 

Ser Phe Val Asp 
195 

Lys Asn Leu Leu 
210 

Asp Leu Ser Leu 

225 

Pro Leu Met Gly 

Asp He Gly Phe 
260 

Trp Ser Gly Pro 
275 

Lys He Gin Met 
290 

Ser Cys His Asn 
305 

Ser Asp Trp Asp 

Glu Lys Gly Lys 
340 

Thr Ala Ser Tyr 
355 

Asn Ala Asn His 
370 

Thr Asn Thr Leu 
385 

Lys Asp Leu Trp 



120 

Phe Ser Arg Asp 
135 

Asp Phe Ser Asn 
150 

Arg Ser Pro Ser 
165 

Phe Ala Tyr Thr 

Tyr Val Phe Ser 

200 

Phe Ser Gin Asp 
215 

Gly Ser Thr Ser 

230 

Val Val Phe Glu 
245 

Leu Ser Asn Phe 

He Lys Val Ser 
280 

Leu Leu Asp Glu 
295 

Phe Met He Thr 
310 

He Arg Tyr Val 
325 

Thr Pro Thr Met 

Gin Ser Glu Thr 
360 

Tyr Leu He Thr 
375 

Lys Tyr Arg Trp 
390 

Ser Asp Thr Phe 



Ala Asn Ala Gin 
140 

Val Gin Leu Tyr 
155 

Ser Leu Asn Asn 
170 

Gly Phe Asn Asn 
185 

Leu Asn Gly Gly 

Gly Glu Lys Lys 
220 

Glu Ser Met Gly 

235 

Met Asn Lys Gin 
250 

Val Asp Asn Asp 
265 

Glu Ser Asp Met 

Asp Asn Asn Thr 
300 

Tyr Ser Asp Tyr 
315 

Tyr Pro Lys Lys 
330 

Asp Asp Leu Val 
345 

Asn Ser Gly Leu 

Tyr Ala Lys Lys 
380 

Ala Asn Tyr Asp 
395 

Thr Tyr Thr Ser 



125 

Asn Ala Lys Leu 

Asp Val Asp He 
160 

Gly Gly Asn Pro 
175 

Thr His Lys He 
190 

Gin Asn Phe Asn 

205 

He Asp Lys Val 

His Asn Ala Trp 

240 

Gly Gly Lys Ser 
255 

Pro Glu Phe Gin 
270 

Ser Phe Ser Pro 

285 

He Asn Gly Glu 

Asp- Sei? Glu Tyr 
320 

Ser Phe Lys Tyr 
335 

Glu Ala Phe Leu 
350 

Gly Tyr Asp Lys 
365 

Glu Glu Asn Gly 

Lys He His Asn 
400 

Ser Ala Asn Ala 
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405 410 415 

Leu Tyr Thr Pro Gin Val Asp lie Asn Pro Thr Lys Gly Leu Val Cys 

420 425 430 

Trp Ser Trp Val Glu Tyr Leu Pro Gly Lys Arg lie Val Trp Ser Asp 

435 440 445 

Thr Gin Trp Thr His Ala Asn Gly Val Glu Asp lie Val Met Gin Glu 

450 455 460 

Gly Ser Met Lys Leu Tyr Pro Asn Pro Ala Gin Glu Tyr Ala Val lie 
465 470 475 480 

Ser Leu Pro Thr Ala Ala Asn Cys Lys Ala Val Val Tyr Asp Met Gin 

485 490 495 

Gly Arg Val Val Ala Glu Ala Ser Phe Ser Gly Asn Glu Tyr Arg Leu 

500 505 510 

Asn Val Gin His Leu Ala Lys Gly Thr Tyr lie Leu Lys Val Val Ser 

515 520 525 

Asp Thr Glu Arg Phe Val Glu Lys Leu lie Val Glu 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 116 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . .259 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 

Met Lys Lys Thr Thr Leu Thr Gly Ser lie Cys Ala Leu Leu Leu Phe 
15 10 15 

5 Leu Gly Leu Ser Ala Asn Ala Gin Ser Lys Leu Lys lie Lys Ser lie 
20 25 30 

Glu Ala Ala Thr Thr Phe Ser Ser Ala Thr Ala Gly Asn Gly Phe Gly 

35 40 45 

Gly Asn lie Phe Gly Met Asp Met Ser lie Arg Met Arg Val His His 
10 50 55 60 

Ser lie Leu Pro Glu Gly Leu Asp Phe Ser Val Gly lie His Glu Arg 
65 70 75 80 

Arg Ala His Trp Glu Glu Ala Gly Ser Pro Lys Leu Met Tyr Thr Asn 
85 90 95 

15 Val Pro Ser lie lie Gly lie Val Glu Lys Val lie Val Phe Glu Asp 
100 105 110 

Ala Glu Asp Phe Phe Asp Lys Lys Ala Leu Gly Arg Phe Leu lie Ser 

115 120 125 

Leu Gly lie Ser Tyr Thr Lys His Leu Gly Ala Tyr Trp Gly Trp Thr 
20 130 135 140 

Asn Asp Ala His lie Leu Phe Ser Pro lie Pro Lys Ser Lys Val His 
145 150 155 160 

Tyr Asp Thr Tyr Thr Arg Ala Gly Ser Asp Leu Val Leu Gin Ser Glu 
165 170 . - 175 

25 Asp Val Ala Thr Val Ser Asn Gly Phe Ser Pro Gly lie Gly Leu Lys 
180 185 190 

Ser Ser lie Trp Trp Lys Met Pro lie Lys Ser Lys Tyr Asp Phe Arg 

195 200 205 

Leu Gly Phe Ser Leu Gly Tyr Glu Tyr Leu Asn Leu Leu Tyr Pro Tyr 
30 210 215 220 

Arg Asn Phe Lys Leu Asp Gly Asn Lys Pro Leu Ser Ala Leu Ser Pro 
225 230 235 240 

Arg Met Asn His lie Gly His Val Gly Phe Asn Phe Thr Val Gly Leu 
245 250 255 

35 Trp Thr Asn 
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(2) INFORMATION FOR SEQ ID NO: 117 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .271 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 

Met Met Lys Ser Met Arg Ser Val Leu Leu Leu Leu Phe Pro Leu Ser 

15 10 15 

Leu lie Thr Ala Leu Gly Cys Ser Asn Asn Lys Ala Ala Gin Ser Lys 

20 25 30 

Ser Val Ser Phe Asp Ser Ala Tyr Leu Glu Arg Tyr lie Pro Leu Arg 

35 40 45 

Ala Asp He Asp Thr Pro Ser Leu His Val Met He Ser Tyr Val Tyr 

50 55 60 

Pro Ser Gly Asp Asp Met Leu Thr Glu He Phe Asn Gly Leu Leu Phe 

65 70 75 80 

Gly Asp Ser Leu Met Asp Ser Ser Ser Pro Glu Asn Ala Met Glu Gly 

85 90 95 

Tyr Ala Gin Met Leu Gly Glu Asp Tyr Arg Ser Asn Asn Ala Glu Ala 

100 105 HO 

Asn Leu Gin Gly Leu Pro Ser Asp Leu Leu Asp Tyr He Tyr Lys Gin 
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115 

Glu Asn Thr lie 
130 

Asn Thr Tyr Thr 
145 

Phe Ala Asn lie 

lie Phe Lys lie 
180 

Gin Leu Val His 
195 

lie Gly Phe Phe 
210 

lie Asp Asp Lys 

225 

Ala Tyr Ala Arg 

Ala Pro Leu Leu 
260 



120 

Ala Tyr Cys Asp 
135 

Tyr Glu Gly Gly 
150 

Leu Arg Thr Thr 
165 

Asp Tyr Ala Glu 

Asp Phe Gly Lys 
200 

Asn Ala Glu Glu 
215 

Gly Leu Thr Tyr 

230 

Gly Ala Val Tyr 
245 

Arg Asp Asp Ser 



Thr Gly Leu lie 
140 

Ala His Thr Glu 
155 

Gly Lys Val Leu 
170 

Arg Leu Ser Ala 
185 

Thr Thr Pro Ala 

lie Gin Pro Asn 
220 

Cys Phe Asn Glu 

235 

Val Arg Leu Gly 
250 

Pro Leu Lys Arg 
265 



125 

Ser Thr Arg lie 

Asn Thr Val Arg 
160 

Glu Glu Arg Asp 
175 

Leu lie lie Gly 
190 

Glu Leu Asp Ala 
205 

Gly Asn Phe Met 

Tyr Gin lie Ala 

240 

Tyr Asp Val Leu 
255 

Tyr Leu Pro 
270 



(2) INFORMATION FOR SEQ ID NO: 118 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/ KEY : misc feature 



259 



(B) LOCATION 1 . . . 270 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 

Met Lys Ser Met Arg Ser Val Leu Leu Leu Leu Phe Pro Leu Ser Leu 

1 5 io 15 

He Thr Ala Leu Gly Cys Ser Asn Asn Lys Ala Ala Glu Ser Lys Ser 

20 25 30 

Val Ser Phe Asp Ser Ala Tyr Leu Glu Arg Tyr He Pro Leu Arg Ala 

35 40 45 

Asp He Asp Thr Pro Ser Leu His Val Met He Ser Tyr Val Tyr Pro 

50 55 60 

Ser Gly Asp Asp Met Leu Thr Glu He Phe Asn Gly Leu Leu Phe Gly 
65 70 75 80 

Asp Ser Leu Met Asp Ser Ser Ser Pro Glu Asn Ala Met Glu Gly Tyr 

85 90 95 

Ala Gin Met Leu Gly Glu Asp Tyr Arg Ser Asn Asn Ala Glu Ala Asn 

100 105 HO 

Leu Gin Gly Leu Pro Ser Asp Leu Leu Asp Tyr He Tyr Lys Gin Glu 

115 120 125 

Asn Thr He Ala Tyr Cys Asp Thr Gly Leu He Ser Thr Arg He Asn 

130 135 140 

Thr Tyr Thr Tyr Glu Gly Gly Ala His Thr Glu Asn Thr Val Arg Phe 
145 150 155 - ~ 160 

Ala Asn He Leu Arg Thr Thr Gly Lys Val Leu Glu Glu Arg Asp He 

165 170 175 

Phe Lys He Asp Tyr Ala Glu Arg Leu Ser Ala Leu He He Gly Gin 

180 185 190 

Leu Val His Asp Phe Gly Lys Thr Thr Pro Ala Glu Leu Asp Ala He 

195 200 205 

Gly Phe Phe Asn Ala Glu Glu He Gin Pro Asn Gly Asn Phe Met He 

210 215 220 

Asp Asp Lys Gly Leu Thr Tyr Cys Phe Asn Glu Tyr Gin He Ala Ala 
225 230 235 240 

Tyr Ala Arg Gly Ala Val Tyr Val Arg Leu Gly Tyr Asp Val Leu Ala 
245 250 255 
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Pro Leu Leu Arg Asp Asp Ser Pro Leu Lys Arg Tyr Leu Pro 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 400 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 

Met Val Met Lys Leu lie Lys Arg Ser Leu Leu Leu Leu Gly Ala Val 

15 10 15 

Leu Leu lie Thr Leu Pro Ala Tyr Ser Gin Asn Asp Asp lie Phe Glu 

20 25 30 

Asp Asp lie Tyr Thr Ser Arg Lys Glu lie Arg Lys Gin Asn Gin Val 

35 40 45 

Lys Asp Trp Gin Asn Gin Glu Asp Gly Tyr Gly Asp Asp Thr Glu Tyr 

50 55 60 

Thr Val Ala Ser Asp Arg Asp lie Asp Ala Tyr Asn Arg Arg Asp Gly 

65 70 75 80 

Gin Ser Tyr Asp Gly Lys Lys Leu Ser Lys Asp Lys Lys Arg Asp Ser 

85 90 95 

Thr Arg Ser Ser Val Pro Gly Arg Tyr Ser Arg Arg Leu Ala Arg Phe 
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100 

Tyr Lys Pro Asn Thr 
115 

Thr Asp Asp Gly Glu 
5 130 

Ala Ser Ser Val Asn 
145 

Pro Tyr Thr Ser Trp 
165 

10 Trp Asn Tyr Pro Trp 
180 

Tyr Tyr Pro Gly Tyr 
195 

Tyr Asn Pro Tyr Gly 
15 210 

Gly Ser Tyr Tyr Gly 
225 

His His Tyr Pro Lys 
245 

20 Tyr Tyr Ser Tyr Gly 
260 

Ala Lys Leu Gly Thr 
275 

Gin Lys Asn Lys Phe 
25 290 

Gin Asn Val Lys Ser 
305 

lie Glu Thr Val Thr 
325 

30 Phe Gin Gin Asn Gin 
340 

Arg Ser Glu Arg Gin 
355 

Ser Arg Ser Asn Ser 
35 370 

Ser Gly Ser Met Ser 



105 

lie Val lie Ser Gly 
120 

Tyr Phe Val Tyr Gly 
135 

lie Tyr lie Asn Ser 
150 

Tyr Pro Ser Phe Ser 
170 

Phe Tyr Tyr Gly Ser 
185 

Asn Trp Tyr Trp Ser 
200 

lie Gly Met Gly Trp 
215 

Trp Gly Gly Tyr Pro 
230 

Lys Thr Tyr Ser Asn 
250 

Arg Pro Asn Arg lie 
265 

Gly Arg Tyr Asp Arg 
280 

Gly Leu Gin Ser Asn 
295 

Gly Arg Thr Gly Arg 
310 

Pro Asn Asn Gly Gin 
330 

Ser Gly Asn Asp Arg 
345 

Gly Glu Asn Asn Asp 
360 

Asn Gly Gly Phe Ser 
375 

Gly Gly Gly Gly Arg 



110 

Ala Asp Asn Val Tyr Val 
125 

Asp Glu Tyr Tyr Asp Asp 
140 

Pro Trp Cys Asp Pro Phe 
155 160 
Gly Trp Tyr Asn Tyr Thr 
175 

His lie Gly Trp Gly Gly 
190 

Tyr Tyr Tyr Asp Pro Phe 
205 

Gly Tyr Pro Tyr Gly Trp 
220 

Gly Val lie His His Tyr 
235 240 
Gly Gin His Ser Gly Ala 
255 

Lys Gly Gly Thr Ser Gly 
270 

lie Gin Asn Ser Ser Ser 
285 

Lys Pro Asn. Asn Asn Leu 
300 

Ala Asn Arg Asp Arg Asn 
315 320 
Lys Gin Asn Arg Pro Val 
335 

Pro Thr Gly Arg Asn lie 
350 

Arg Thr Phe Ser Thr Pro 
365 

Thr Pro Ser Arg Ser Ser 
380 

Ser Gly Arg Gly Arg Asn 
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385 390 395 400 

(2) INFORMATION FOR SEQ ID NO: 120 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1. . .398 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 



Met Lys Leu lie Lys Arg Ser Leu Leu Leu Leu Gly Ala Val Leu Leu 
1 5 10 - 15 

25 lie Thr Leu Pro Ala Tyr Ser Gin Asn Asp Asp lie Phe Glu Asp Asp 
20 25 30 

lie Tyr Thr Ser Arg Lys Glu lie Arg Lys Gin Asn Gin Val Lys Asp 

35 40 45 

Trp Gin Asn Gin Glu Asp Gly Tyr Gly Asp Asp Thr Glu Tyr Thr Val 
30 50 55 60 

Ala Ser Asp Arg Asp lie Asp Ala Tyr Asn Arg Arg Asp Gly Gin Ser 
65 70 75 80 

Tyr Asp Gly Lys Lys Leu Ser Lys Asp Lys Lys Arg Asp Ser Thr Arg 
85 90 95 

35 Ser Ser Val Pro Gly Arg Tyr Ser Arg Arg Leu Ala Arg Phe Tyr Lys 
100 105 110 
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Pro Asn Thr He Val He Ser Gly Ala Asp Asn Val Tyr Val Thr Asp 
115 120 125 



Asp Gly Glu Tyr Phe Val Tyr Gly Asp Glu Tyr Tyr Asp Asp Ala Ser 

130 135 140 

Ser Val Asn He Tyr He Asn Ser Pro Trp Cys Asp Pro Phe Pro Tyr 

14S 

Thr Ser Trp Tyr Pro Ser Phe Ser Gly Trp Tyr Asn Tyr Thr Trp Asn 

165 170 " 5 

Tyr Pro Trp Phe Tyr Tyr Gly Ser His He Gly Trp Gly Gly Tyr Tyr 

180 185 I 90 

Pro Gly Tyr Asn Trp Tyr Trp Ser Tyr Tyr Tyr Asp Pro Phe Tyr Asn 



195 



200 205 



Pro Tyr Gly He Gly Met Gly Trp Gly Tyr Pro Tyr Gly Trp Gly Ser 

210 215 220 

Tyr Tyr Gly Trp Gly Gly Tyr Pro Gly Val He His His Tyr His His 



240 

225 

rr 



230 235 



Tyr Pro Lys Lys Thr Tyr Ser Asn Gly Gin His Ser Gly Ala Tyr Ty 

245 250 255 

Ser Tyr Gly Arg Pro Asn Arg lie Lys Gly Gly Thr Ser Gly Ala Lys 

260 265 270 

Leu Gly Thr Gly Arg Tyr Asp Arg He Gin Asn Ser Ser Ser Gin Lys 
275 

Asn Lys 



280 285 



Phe Gly Leu Gin Ser Asn Lys Pro Asn Asn Asn Leu Gin Asn 



290 295 300 



Val Lys Ser Gly Arg Thr Gly Arg Ala Asn Arg Asp Arg Asn He Glu 



305 



310 



315 



320 



Thr Val Thr Pro Asn Asn Gly Gin Lys Gin Asn Arg Pro Val Phe Gin 



325 



330 335 



Gl 



n Asn Gin Ser Gly Asn Asp Arg Pro Thr Gly Arg Asn He Arg Ser 



340 



345 350 



Glu Arg Gin Gly Glu Asn Asn Asp Arg Thr Phe Ser Thr Pro Ser Arg 

355 3 60 365 

Ser Asn Ser Asn Gly Gly Phe Ser Thr Pro Ser Arg Ser Ser Ser Gly 

370 575 3 80 

Ser Met Ser Gly Gly Gly Gly Arg Ser Gly Arg Gly Arg Asn 
385 3 90 395 
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(2) INFORMATION FOR SEQ ID NO: 121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE : 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1. . .207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

Met Lys Arg Met Leu Leu Leu Leu Val Val Leu Leu Tyr Gly lie Ala 

15 10 15 

Gly Arg Leu Ala Ala Gin Asp Val lie Arg Pro Trp Ser. Leu Gin Val 

20 25 30 

Gly Ala Gly Tyr Ser Asp Thr Glu Asn lie Pro Gly Gly Phe Thr Tyr 

35 40 45 

Gly Phe Tyr Leu Gly Lys Arg Met Gly Ser Phe Leu Glu Val Gly Leu 

50 55 60 

Ser Met Tyr Asn Ser Thr Arg Gin Thr Ala Asn Asn Ala Asp Ser Phe 
65 70 75 80 

Ala Ser Asn Glu Gly Asp Gly Ser Phe Gin Val Asn Met Ser Ser Pro 

85 90 95 

Asn Glu Lys Trp Ser Phe Phe Asp Ala Gly Ser Ala Asn Cys Tyr Met 

100 105 110 

lie Val Val Gly Val Asn Pro Leu His Leu Phe Trp Gin Asn Ser Arg 
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115 120 125 

His Asn Leu Phe Leu Ala Val Gin Ala Gly Leu Ser Asn Lys His Asn 

130 135 140 

lie His Phe He Tyr Gly Asp Lys Gly Ala Lys Val Ser He Tyr Thr 
5 145 150 155 160 

Asn Ser Asn Thr Tyr He Gly Tyr Gly Ala Arg Val Ala Tyr Glu Tyr 

165 170 175 

Gin He His Lys Asn Val Gly Ala Gly Ala Ala Val Met Tyr Asp His 
180 185 190 

10 Gly Asn Lys Met Leu Thr Ala Met Ala Thr Leu Ser Thr His Phe 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 122 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . .771 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 



Met Leu Thr He Arg Asn Phe Leu Leu Phe Cys Cys Leu Ser Leu He 
15 10 15 

35 Ala Phe Ala Ala Asp Ala Gin Ser Ser Val Ser Ser Gly Arg Arg Leu 

20 25 30 
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Thr Glu Tyr Val Asn Pro Phe He Gly Thr Ala Asn Tyr Gly Thr Thr 

35 40 45 

Asn Pro Gly Ala Val Leu Pro Asn Gly Leu Met Ser Val Thr Pro Phe 

50 55 60 

Asn Val Ser Gly Ser Thr Glu Asn Arg Phe Asp Lys Asp Ser Arg Trp 
65 70 75 80 

Trp Ser Ala Pro Tyr Ser Ala Asp Asn Ser Tyr Cys He Gly Phe Ser 

85 90 95 

His Val Asn Leu Ser Gly Val Gly Cys Pro Glu Leu Ser Gly lie Leu 

100 105 110 

Leu Met Ala Thr Ser Gly Thr Phe Asp Pro Asp Tyr Cys Cys Tyr Gly 

115 120 125 

Ser Ser Leu Ser Arg Glu Tyr Ala Arg Pro Gly Glu Tyr Lys Ala Val 

130 135 140 

Leu Asp Lys Tyr Gly He Asp Ala Ala Val Thr Val Thr Glu Arg Thr 
145 150 155 160 

Ala Leu Thr Glu Phe Ala Phe Pro Glu Gly Glu Gly His He Leu Leu 

165 170 175 

Asn Leu Gly Gin Ala Leu Ser Asn Glu Ser Gly Ala Ser Val Arg Phe 

180 185 190 

Leu Asn Asp Ser Thr Val Val Gly Ser Arg Leu Met Gly Thr Phe Cys 

195 200 205 

Tyr Asn Pro Gin Ala Val Phe Arg Gin Tyr Phe Val Leu Gin Val Ser 

210 215 220 

Arg Arg Pro He Ser Ala Gly Tyr Trp Lys Lys Gin Pro Pro Met Thr 
225 230 235 240 

Val Glu Ala Gin Trp Asp Ser Thr Ala Gly Lys Tyr Lys Gin Tyr Asp 

245 250 255 

Gly Tyr Lys Arg Glu Met Ser Gly Asp Asp He Gly Val Arg Phe Ser 

260 265 270 

Phe Asn Cys Asp Gin Gly Glu Lys He Tyr Val Arg Ser Ala Val Ser 

275 280 285 

Phe Val Ser Glu Ala Asn Ala Leu Tyr Asn Leu Glu Ala Glu Gin Glu 

290 295 300 

Glu Val Phe Lys Ser Val Gly Gly Asn Pro Ala Lys Ala Phe Ser Ala 
305 310 315 320 
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He Arg Ser Arg Ala He Glu Arg Trp Glu Glu Ala Leu Gly Thr Val 

325 330 335 

Glu Val Glu Gly Gly Thr Pro Asp Glu Lys Thr He Phe Tyr Thr Ala 

340 345 350 

Leu Tyr His Leu Leu He His Pro Asn He Leu Gin Asp Ala Asn Gly 

355 360 365 

Glu Tyr Pro Met Met Gly Ser Gly Lys Thr Gly Asn Thr Ala His Asp 

370 375 380 

Arg Tyr Thr Val Phe Ser Leu Trp Asp Thr Tyr Arg Asn Val His Pro 
385 390 395 400 

Leu Leu Cys Leu Leu Tyr Pro Glu Lys Gin Leu Asp Met Val Arg Thr 

405 410 415 

Leu He Asp Met Tyr Arg Glu Ser Gly Trp Leu Pro Arg Trp Glu Leu 

420 425 430 

Tyr Gly Gin Glu Thr Leu Thr Met Glu Gly Asp Pro Ser Leu He Val 

435 440 445 

He Asn Asp Thr Trp Gin Arg Gly Leu Arg Ala Phe Asp Thr Ala Thr 

450 455 460 

Ala Tyr Glu Ala Met Lys Lys Asn Ala Ser Ser Ala Gly Ala Thr His 
465 470 475 480 

Pro He Arg Pro Asp Asn Asp Asp Tyr Leu Thr Leu Gly Phe Val Pro 

485 490 495 

Leu Arg Glu Gin Tyr Asp Asn Ser Val Ser His Ala Leu Glu Tyr Tyr 

500 505 • 510 

Leu Ala Asp Trp Asn Leu Ser Arg Phe Ala His Ala Leu Gly His Lys 

515 520 525 

Glu Asp Ala Ala Leu Phe Gly Lys Arg Ser Leu Gly Tyr Arg His Tyr 

530 535 540 

Tyr Asn Lys Glu Tyr Gly Met Leu Cys Pro Leu Leu Pro Asp Gly Ser 
545 550 555 560 

Phe Leu Thr Pro Phe Asp Pro Lys Gin Gly Glu Asn Phe Glu Pro Asn 

565 570 575 

Pro Gly Phe His Glu Gly Ser Ala Tyr Asn Tyr Ala Phe Phe Val Pro 

580 585 590 

His Asp He Gin Gly Leu Ala Arg Leu Met Gly Gly Ala Lys Val Phe 
595 600 605 
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Ser Glu Arg Leu Gin Lys Val Phe Asp Glu Gly Tyr Tyr Asp Pro Thr 

610 615 620 

Asn Glu Pro Asp lie Ala Tyr Pro Tyr Leu Phe Ser Tyr Phe Pro Lys 
625 630 635 640 

5 Glu Ala Trp Arg Thr Gin Lys Leu Thr Arg Glu Leu lie Asp Lys His 

645 650 655 

Phe Cys Asn Ala Pro Asn Gly Leu Pro Gly Asn Asp Asp Ala Gly Thr 

660 665 670 

Met Ser Ala Trp Leu Val Tyr Ser Met Leu Gly Phe Tyr Pro Asp Cys 
10 675 680 685 

Pro Gly Ser Pro Thr Tyr Thr Leu Thr Ser Pro Val Phe Pro Arg Val 

690 695 700 

Arg lie Arg Leu Asn Pro Gin Tyr Tyr Pro Gin Gly Glu Leu lie lie 
705 710 715 720 

15 Thr Thr Asn Thr Glu Asn Gin Pro Thr Asp Ser lie Tyr lie His Thr 

725 730 735 

Val Ser Leu Gly Asn Lys Thr Leu Pro His Gly Thr Arg His lie Ser 

740 745 750 

His Ala Asp Leu Val Arg Cys Gly His Leu Arg Tyr Glu Leu Ser Asn 
20 755 760 765 

Arg Pro Arg 
770 

(2) INFORMATION FOR SEQ ID NO: 123 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: protein 



( iii ) HYPOTHETICAL : YES 



35 (vi) 



ORIGINAL SOURCE: 
(A) ORGANISM: Porphyromonas gingival is 
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(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 951 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 

Met Arg Val Ser Asp Leu Cys Ser Arg Leu Ser Trp Leu Leu Pro Val 

1 5 10 15 

He Leu Val Gly Leu Leu Cys Ala Thr Leu Val Ala Ala Glu Arg Pro 

20 25 30 

Met Ala Gly Ala Val Gly Leu His His Arg Arg His Ala Ala Leu Ser 

35 40 45 

Asp Ser Thr Ala Lys Asp Thr Val Pro Leu Ala Lys Pro He Pro Asp 

50 55 60 

Ser Ala Phe Arg Asp Ser Leu Pro Ala Asp Ser Thr Gly Ser Met Arg 
65 70 75 80 

Gin Asp Ser Val Tyr Asp Asp Glu Phe Glu Leu Glu Asp He Val Glu 

85 90 95 

Tyr Glu Ala Ala Asp Ser He Val Leu Leu Gly Gin Asn Arg Ala Tyr 

100 105 HO 

Leu Phe Gly Lys Ser Tyr Val Ser Tyr Gin Lys Ser Arg Leu Glu Ala 

115 120 125 

Asn Phe Met Tyr Leu Asn Thr Asp Ser Ser Thr Val Tyr Thr Arg Tyr 

130 135 140 

Val Leu Asp Thr Ala Gly Tyr Pro Met Ala Phe Pro Val Phe Lys Asp 
145 150 155 160 

Gly Glu Gin Ser Phe Glu Ala Lys Asn Phe Thr Tyr Asn Phe Arg Thr 

165 170 175 

Glu Lys Gly He He Ser Gly Val He Thr Gin Gin Gly Glu Gly Tyr 

180 185 190 

Leu Thr Ala Gly Lys Thr Lys Lys Met Pro Asp Asn He Met Phe Met 

195 200 205 

Gin Gly Gly Arg Tyr Thr Thr Cys Asp Asn His Asp His Pro His Phe 

210 215 220 

Tyr He Asn Leu Ser Lys Ala Lys Val His Pro Glu Lys Asp He Val 
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225 230 235 240 

Thr Gly Pro Val Asn Leu Val lie Ala Asp Met Pro Leu Pro lie Gly 

245 250 255 

Leu Pro Phe Gly Tyr Phe Pro Phe Ser Asn Lys Tyr Ser Ser Gly He 

260 265 270 

Leu Met Pro Thr Tyr Gly Glu Asp Asn Arg Tyr Gly Phe Tyr Leu Arg 

275 280 285 

Asn Gly Gly Tyr Tyr Phe Ala Phe Ser Asp Tyr He Asp Leu Ala Leu 

290 295 300 

Arg Gly Glu He Phe Ser Lys Gly Ser Trp Gly He Ser Ala Gin Ser 
305 310 315 320 

Lys Tyr Lys Lys Arg Tyr Lys Tyr Asn Gly Ser Phe Glu Ala Asn Tyr 

325 330 335 

Leu Val Ser Lys Ser Gly Asp Lys Tyr Val Pro Gly Asp Tyr Ser Lys 

340 345 350 

Thr Thr Ser Leu Asn He Arg Trp Thr His Ser Gin Asp Pro Lys Ala 

355 360 365 

Asn Pro Leu Gin Thr Leu Ser Ala Asn Val Asn Phe Ala Thr Gly Ser 

370 375 380 

Tyr Phe Gin Asn Ser Leu Asn Thr Thr Tyr Asp Val Asn Ala Arg Thr 
385 390 395 400 

Ala Thr Thr Arg Ser Ser Ala Val Ser Tyr Ser Arg Lys Phe Pro Gly 

405 410 415 

Thr Pro Phe Ser He Thr Gly Ser Met Asp He Ser Gin Asn Met Arg 

420 425 430 

Asp Thr Thr Val Ser Leu Thr Leu Pro Asn Leu Ser He Asn Met Ser 

435 440 445 

Thr Arg Tyr Pro Phe Lys Arg Lys Thr Arg Val Gly Pro Glu Arg Trp 

450 455 460 

Tyr Glu Lys Leu Ser Val Gly Tyr Ser Gly Gin Leu Arg Asn Ser He 
465 470 475 480 

Leu Thr Lys Glu Lys Asp Leu Leu Gin Ser Asn Leu Val Arg Asp Trp 

485 490 495 

Lys Asn Gly Met Arg His Ser Val Pro He Ser Leu Thr Val Pro Leu 

500 505 510 

Leu Asp Tyr He Asn Leu Thr Met Gly Val Asn Tyr Asn Glu Trp Trp 
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515 

Tyr Thr Lys Gly 
530 

Leu Pro Ser Asp 
545 

Leu Ser Ala Gly 

Lys Pro Phe Ser 
580 

Thr Pro Thr Val 
595 

Tyr Gly Phe Trp 
610 

His Thr Leu Leu 
625 

Ser Met Gly Asn 

Glu Ala Lys lie 
660 

Ser Leu lie Asp 
675 

Asp Ser lie Arg 
690 

Ser Lys Ser Phe 
705 

Thr Lys Tyr Tyr 

Asn Asp Leu Arg 
740 

Thr Gly Thr Ser 
755 

Leu lie Ala Leu 
770 

Asn Thr Gly Ala 
785 

Glu Gly Gly Arg 



520 

lie Arg Lys Ser 
535 

Thr Thr Tyr Lys 
550 

Leu Ser Thr Thr 
565 

Phe Gly Gly Asn 

Ser Phe Ser Tyr 
600 

Glu Leu Leu Glu 
615 

Tyr Ser Pro Tyr 
630 

Ala Gly Ser Val 
645 

Lys Ser Lys Ser 

Gin Phe Thr Trp 
680 

Trp Ser Asn lie 
695 

Thr Leu Arg Leu 
710 

Glu Gly Glu Asp 
725 

lie Phe Asn Gly 

Phe Ser Tyr Thr 
760 

Phe Ser Gly Lys 
775 

Thr Pro His Glu 
790 

Pro Gin Asn Glu 



Trp Asn Glu Asp 
540 

Phe Arg Arg Leu 
555 

Leu Tyr Gly Met 
570 

Leu lie Met lie 
585 

Met Pro Asp Phe 

His Thr Asp Gin 
620 

Phe Glu Gin lie 
635 

Asn Phe Ser Phe 
650 

Asp Ser Thr Gly 
665 

Ser Thr Ser Tyr 

Ser Ala Ser Leu 
700 

Ser Gly Leu Phe 
715 

Gly Lys lie lie 
730 

Lys Gly Leu Ala 
745 

Leu Asn Lys Glu 

Lys Glu Arg Arg 
780 

Gly Asp Asp Ala 
795 

Ser Gly Gly Ser 



525 

Lys Lys Thr Phe 

Tyr Asp Tyr Ser 
560 

Phe Lys Pro Trp 
575 

Arg His Arg Phe 
590 

Thr Lys Arg Arg 
605 

Asn Gly Lys Leu 

Phe Gly Ala Pro 
640 

Asp Asn Asn Leu 
655 

lie Lys Lys lie 
670 

Asn Met Phe Ala 
685 

Ala Leu Arg Leu 

Asp- Pro Tyr Leu 
720 

Pro Tyr Lys Ser 
735 

Arg Leu lie Ser 
750 

Ser Leu Ser Gly 
765 

Asp Glu Lys Lys 

Ala Asp lie Leu 
800 

Leu Leu Glu Arg 
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805 810 815 

Asn Arg Gin Gly Gly Ala Val Asp Gin Asp Gly Tyr Phe Ala Tyr Ser 

820 825 830 

lie Pro Trp Ser Leu Ser Phe Asp Tyr Ser Trp Asn He Ala Thr Asp 

835 840 845 

Tyr Asn Arg Tyr Asn Val Asn Lys Met Glu His Tyr Tyr Arg Val Thr 

850 855 860 

Gin Asn Leu Ser Phe Arg Gly Asn He Gin Pro Thr Pro Asn Trp Ser 
865 870 875 880 

Phe Gly Phe Asn Ala Asn Tyr Asn Phe Asp Leu Lys Lys He Thr Ser 

885 890 895 

Leu Thr Cys Asn Val Thr Arg Asp Met His Cys Trp Ala He Ser Ala 

900 905 910 

Ser Phe He Pro He Gly Ala Tyr Lys Ser Tyr Asn Phe Val He Ser 

915 920 925 

Val Lys Ser Ser Leu Leu Gin Asp Leu Lys Tyr Gin Gin Ser Asn Arg 

930 935 940 

Pro He Thr Asn Thr Trp Tyr 
945 950 

(2) INFORMATION FOR SEQ ID NO: 124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc feature 
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(B) LOCATION 1 ... 581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 

Met He Arg Lys Leu He Leu Leu Leu Ala Leu Met Pro Val Ala Ser 

1 5 10 15 

Val Ala Phe Ala Val Pro Thr Asp Ser Thr Glu Ser Lys Asp Asn Arg 

20 25 30 

He Leu Thr Ser Met Gin Ser Ser Ser Leu Asn Arg Asp Asp Ala Pro 

35 40 45 

Asp Lys Trp Gin Pro Met His Ala Asn Phe Ser He Gin Ser Asp Met 

50 55 60 

Leu Leu Ser Thr Ala Gin Lys Ser Lys Asn Thr Trp Phe Gly Asn Ser 
65 70 75 80 

Tyr He Met Gly He He Lys Asn Asn Tyr Leu Glu Phe Gly Ala Arg 

85 90 95 

Phe Glu Asp Leu Tyr Lys Pro Leu Pro Gly His Glu Pro Glu Met Unk 

100 105 HO 

Arg Gly Val Pro His Met Tyr Val Lys Gly Ser Tyr His Trp Ala Glu 

115 120 125 

Leu Thr Met Gly Asp Phe Tyr Asp Gin Phe Gly Ser Gly Met Val Phe 

130 135 140 

Arg Thr Tyr Glu Glu Arg Asn Leu Gly He Asp Asn Ala Val Arg Gly 
145 150 155 - • 160 

Gly Arg He Val Leu Thr Pro Phe Asp Gly Val Arg Val Lys Gly He 

165 170 175 

Ala Gly Gin Gin Arg Asn Tyr Phe Asp Arg Thr Gly Lys Val Phe Asn 

180 185 190 

Ser Gly Arg Gly Tyr Leu Leu Gly Ser Asp Leu Glu Leu Asn Val Glu 

195 200 205 

Arg Trp Ser Ser Ala Met Arg Asp Asn Asp Tyr His Leu Ala He Gly 

210 215 220 

Gly Ser Phe Val Ser Lys His Glu Ala Asp Glu Asp He Phe Val Gly 
225 230 235 240 

Val Gly Glu Asp Arg Lys Arg Leu Asn Leu Pro Leu Asn Val Pro He 
245 250 255 
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Met Gly Leu 

Glu Tyr Gly 
275 

5 Tyr His Asp 
290 
Gly Met Ser 
305 

Arg Ser Lys 

10 

Pro Ala Phe 

Tyr Ala Thr 
355 

15 Tyr Asn Phe 
370 
Leu Arg lie 
385 

Lys Glu Asn 

20 

Phe Phe Gly 

Thr Lys Lys 
435 

25 Gin lie Tyr 
450 

Glu Lys lie 
465 

Ser Asn Lys 

30 

Gin Asp Gin 

Pro Ser Leu 
515 

35 Lys Lys His 
530 



Arg Thr Asn Phe 

260 

Tyr Lys Tyr Asn 

Gly Gin Ala Ala 
295 

lie Leu Leu Gin 
310 

Arg Ser Ala Gin 
325 

Thr Gin Ala His 
340 

Gin Pro Gin Gly 

Ala Arg Arg Thr 
375 

Asn Val Ser His 
390 

Pro Asp Glu Leu 
405 

Met Gly Asp Leu 
420 

Val Ser Pro Gly 

Asn Asn Lys Val 
455 

Tyr Ala Asn lie 
470 

Val Ala Leu Arg 
485 

Gly Asp Trp lie 
500 

Met Leu Ser Leu 

Tyr Val Met Gly 
535 



Gin Lys Gly Gly Leu 

265 

Asp Pro Ser Ala Asp 
280 

Leu Leu Ser Ala Ser 
300 

Ala Lys Arg Cys Glu 
315 

Leu Thr Pro Leu Met 
330 

Thr Tyr Thr Leu Ala 
345 

Glu Trp Ala Phe Gin 
360 

Ala Leu Gly Gly Arg 
380 

Val Arg Gly Leu Asp 
395 

lie Gly Thr Asp Gly 
410 

Tyr Tyr Ser Asp lie 
425 

Phe Asn Phe Thr Leu 
440 

Leu His Gly Ala Ala 
460 

Phe Val Tyr Asp Gly 
475 

Thr Glu Leu Gin Tyr 
490 

Tyr Gly Met Ala Glu 
505 

Ser Glu Gin Tyr Asn 
520 

Ser Val Thr Tyr Thr 
540 



Ala Leu Tyr Ala 
270 

Asn Asp Tyr lie 
285 

Tyr Ser Lys Lys 

Asn Phe Ala Phe 

320 

lie Asn Tyr Met 

335 

Ala lie Tyr Pro 
350 

Gly Glu Leu Arg 
365 

Tyr Gly Thr Gly 

Lys Lys Met Leu 
400 

Tyr Thr Val Ser 
415 

Asp Val Glu lie 
430 

Thr Tyr Leu Asn 
44S 

Gly Glu Lys Pro 

Lys Tyr Lys Leu 
480 

Leu His Thr Lys 
495 

Leu Ser lie Leu 
510 

lie Gly Glu Thr 
525 

His Gly Ala His 
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Arg Val Ala Phe Ser Ala Gly Lys Thr Arg Ala Gly Met Asn Cys Ser 
545 550 555 560 

Gly Gly Val Cys Arg Val Val Pro Glu Thr Gin Gly Phe Tyr Leu Ser 

565 570 575 

Tyr Ser Thr Asn Leu 
580 

(2) INFORMATION FOR SEQ ID NO: 125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: miscj eature 

(B) LOCATION 1...940 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 

Met Asn Lys Phe Tyr Lys Ser Leu Leu Gin Ser Gly Leu Ala Ala Phe 

15 10 15 

Val Ser Met Ala Thr Ala Leu Thr Ala Ser Ala Gin lie Ser Phe Gly 

20 25 30 

Gly Glu Pro Leu Ser Phe Ser Ser Arg Ser Ala Gly Thr His Ser Phe 

35 40 45 

Asp Asp Ala Met Thr lie Arg Leu Thr Pro Asp Phe Asn Pro Glu Asp 

50 55 60 

Leu lie Ala Gin Ser Arg Trp Gin Ser Gin Arg Asp Gly Arg Pro Val 
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65 

Arg lie Gly 
Ser His lie 

5 

Lys Leu Glu 
115 

lie Pro Glu 
130 

10 Val Leu Gly 
145 

Ala Thr Glu 
Ser Arg Gly 

15 

lie Phe Asp 
195 

lie Gly Glu 
210 

20 Pro Glu Gly 

225 

lie Met Val 
Asn Asn Thr 

25 

Cys Ala Ser 
275 

Lys Trp lie 
290 

30 Thr Leu Ala 
305 

Ala Phe Leu 
Asn Asp Glu 

35 

Ser Thr Pro 



70 

Gin Val lie Pro 
85 

Ser Ser lie Gly 
100 

Gly Ala Lys Ala 

Gly Gly Arg Leu 
135 

Ala Tyr Thr Asn 
150 

Pro Val Pro Gly 
165 

Gly Thr Leu Pro 

180 

Lys Val Gly Gly 

Asp Asp Ser Asp 
215 

Ala Asp Trp Gin 
230 

Lys Gly Gin Tyr 
245 

Lys Gly Asp Phe 
260 

lie Thr Thr Asn 

Phe Thr Phe His 
295 

lie Phe Arg Gly 
310 

Pro lie Lys Gly 
325 

Val Pro Leu Arg 
340 

Asp lie Pro Ser 



75 

Val Asp Val Asp Phe 
90 

Asp Val Asp Val Tyr 
105 

lie Thr Leu Tyr Tyr 
120 

Tyr lie Tyr Thr Pro 
140 

Ala Thr His Arg Arg 
155 

Ser Glu Leu lie Met 
170 

Asp lie Lys lie Ser 
185 

Arg Pro Val Thr Asp 
200 

Ser Asp Cys Glu lie 
220 

Ala Glu Lys Asn Gly 
235 

lie Ser Met Cys Ser 
250 

Thr Pro Leu lie lie 
265 

Phe Gly Val Thr Gin 
280 

Tyr Glu Lys Arg Gly 
300 

Asn Ser He He Gly 
315 

Lys Ser Asp Gly Leu 
330 

Tyr Arg Val Tyr Tyr 
345 

Ser Gly Ala Gly He 



80 

Ala Ser Lys Ala 
95 

Arg Leu Gin Phe 
110 

Asp Ala Phe Asn 
125 

Asp His Glu He 

Asn Gly Ala Phe 
160 

Asp Tyr Glu Val 
175 

Gly Ala Gly Tyr 
190 

Asn His Tyr Gly 
205 

Asn He Asn Cys 

Val Val Gin Met 
240 

Gly Asn Leu Leu 
255 

Ser Ala Gly His 
270 

Ser Glu Leu Asp 
285 

Cys Ser Asn Gly 

Ala Ser Met Lys 
320 

Leu Leu Gin Leu 
335 

Asn Gly Trp Asp 
350 

His His Pro Ala 
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355 360 365 

Gly Asp Ala Met Lys He Ser He Leu Lys Lys Thr Pro Ala Leu Asn 

370 375 380 

Thr Trp He Ser Ser Ser Gly Ser Gly Gly Thr Asp Asp His Phe Tyr 

•son 395 400 

385 390 

Phe Lys Tyr Asp Gin Gly Gly Thr Glu Gly Gly Ser Ser Gly Ser Ser 

405 410 415 

Leu Phe Asn Gin Asn Lys His Val Val Gly Thr Leu Thr Gly Gly Ala 

420 425 430 

Gly Asn Cys Gly Gly Thr Glu Phe Tyr Gly Arg Leu Asn Ser His Trp 

435 440 445 

Asn Glu Tyr Ala Ser Asp Gly Asn Thr Ser Arg Met Asp He Tyr Leu 

450 455 460 

Asp Pro Gin Asn Asn Gly Gin Thr Thr He Leu Asn Gly Thr Tyr Arg 

Ann 475 480 

465 470 HZ 

Asp Gly Tyr Lys Pro Leu Pro Ser Val Pro Arg Leu Leu Leu Gin Ser 

485 490 495 

Thr Gly Asp Gin Val Glu Leu Asn Trp Thr Ala Val Pro Ala Asp Gin 

500 505 510 

Tyr Pro Ser Ser Tyr Gin Val Glu Tyr His He Phe Arg Asn Gly Lys 

515 520 525 

Glu He Ala Thr Thr Lys Glu Leu Ser Tyr Ser Asp Ala He Asp Glu 

530 535 540 

Ser He He Gly Ser Gly He He Arg Tyr Glu Val Ser Aim Arg Phe 
545 550 555 560 

He Tyr Pro Ser Pro Leu Asp Gly Val Glu Ser Tyr Lys Asp Thr Asp 

565 570 575 

Lys Thr Ser Ala Asp Leu Ala He Gly Asp He Gin Thr Lys Leu Lys 

580 585 590 

Pro Asp Val Thr Pro Leu Pro Gly Gly Gly Val Ser Leu Ser Trp Lys 

595 600 605 

Val Pro Phe Leu Ser Gin Leu Val Ser Arg Phe Gly Glu Ser Pro Asn 

610 615 620 

Pro Val Phe Lys Thr Phe Glu Val Pro Tyr Val Ser Ala Ala Ala Ala 
625 630 635 640 

Gin Thr Pro Asn Pro Pro Val Gly Val Val He Ala Asp Lys Phe Met 
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645 650 655 

Ala Gly Thr Tyr Pro Glu Lys Ala Ala He Ala Ala Val Tyr Val Met 

660 665 670 

Pro Ser Ala Pro Asp Ser Thr Phe His Leu Phe Leu Lys Ser Asn Thr 
5 675 680 685 

Asn Arg Arg Leu Gin Lys Val Thr Thr Pro Ser Asp Trp Gin Ala Gly 

690 695 700 

Thr Trp Leu Arg He Asn Leu Asp Lys Pro Phe Pro Val Asn Asn Asp 
705 710 715 720 

10 His Met Leu Phe Ala Gly He Arg Met Pro Asn Lys Tyr Lys Leu Asn 

725 730 735 

Arg Ala He Arg Tyr Val Arg Asn Pro Asp Asn Leu Phe Ser He Thr 

740 745 750 

Gly Lys Lys He Ser Tyr Asn Asn Gly Val Ser Phe Glu Gly Tyr Gly 
15 755 760 765 

He Pro Ser Leu Leu Gly Tyr Met Ala He Lys Tyr Leu Val Val Asn 

770 775 780 

Thr Asp Ala Pro Lys He Asp Met Ser Leu Val Gin Glu Pro Tyr Ala 
785 790 795 800 

20 Lys Gly Thr Asn Val Ala Pro Phe Pro Glu Leu Val Gly He Tyr Val 

805 810 815 

Tyr Lys Asn Gly Thr Phe He Gly Thr Gin Asp Pro Ser Val Thr Thr 

820 825 830 

Tyr Ser Val Ser Asp Gly Thr Glu Ser Asp Glu Tyr Glu lie Lys Leu 
25 835 840 845 

Val Tyr Lys Gly Ser Gly He Ser Asn Gly Val Ala Gin He Glu Asn 

850 855 860 

Asn Asn Ala Val Val Ala Tyr Pro Ser Val Val Thr Asp Arg Phe Ser 
865 870 875 880 

30 He Lys Asn Ala His Met Val His Ala Ala Ala Leu Tyr Ser Leu Asp 

885 890 895 

Gly Lys Gin Val Arg Ser Trp Asn Asn Leu Arg Asn Gly Val Thr Phe 

900 905 910 

Ser Val Gin Gly Leu Thr Ala Gly Thr Tyr Met Leu Val Met Gin Thr 
35 915 920 925 

Ala Asn Gly Pro Val Ser Gin Lys He Val Lys Gin 
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930 935 940 

(2) INFORMATION FOR SEQ ID NO: 126 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . . 1282 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 

Met Arg Lys He Leu Ser Phe Leu Met Met Cys Ser Leu His Leu Gly 

! 5 10 • • 15 

Leu Gin Ser Gin ThrTrp His Gly Asp Pro Asp Ser Val Ala Ala Leu 

20 25 30 

Pro Ser He Gly He Gin Glu Ser Ser Cys Thr Arg He Thr Phe Glu 

35 40 45 

Val Val Phe Pro Gly Phe Tyr Ser Val Glu Lys Arg Glu Gly Asn Gin 

50 55 60 

Val Phe Gin Arg He Ser Met Pro Gly Cys Gly Ser Phe Gly Asn Leu 
65 70 75 80 

Gly Glu Ala Glu Leu Pro Val Leu Lys Lys Met He Ala Val Pro Glu 

85 90 95 

Phe Ser Thr Ala Asn Val Ala Val Lys He Lys Glu Thr Glu Thr Phe 
100 105 11° 
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Asp Asn Tyr Asn lie Tyr Pro Asn Pro Thr Tyr Val Val Glu Glu Leu 

115 120 125 

Pro Glu Gly Gly Thr Tyr Leu Val Glu Ala Phe Ala lie Asn Asn Asp 

130 135 140 

Tyr Tyr Ser Gin Asn Val Ser Leu Pro Ser Thr His Tyr Val Tyr Ser 
145 150 155 160 

Gin Asp Gly Tyr Phe Arg Ser Gin Arg Phe He Glu Val Thr Leu Tyr 

165 170 175 

Pro Phe Arg Tyr Asn Pro Val Arg Gin Glu He Leu Phe Ala Lys Lys 

180 185 190 

He Glu Val Thr He Thr Phe Asp Asn Pro Gin Pro Pro Leu Gin Lys 

195 200 205 

Asn Thr Gly He Phe Asn Lys Val Ala Ser Ser Ala Phe He Asn Tyr 

210 215 220 

Glu Ala Asp Gly Lys Ser Ala He Glu Asn Asp Met Val Phe Ser Arg 
225 230 235 240 

Gly Thr Thr Thr Tyr He Ser Gly Asn Val Ala Ser Asn Leu Pro Gin 

245 250 255 

Asn Cys Asp Tyr Leu Val He Tyr Asp Asp Met Phe Asn Val Asn Gin 

260 265 270 

Gin Pro His Asp Glu He Lys Arg Leu Cys Glu His Arg Ala Phe Tyr 

275 280 285 

Asn Gly Phe Asp Val Ala Ala Val Ser He Lys Asp Val Leu Asn Ser 

290 295 300 

Phe Pro Ser Asn Ala Thr Ser Tyr He Asn Glu Thr Lys Leu Lys Asn 
305 310 315 320 

Phe He Arg Ser Val Tyr Asn Gin Ser Asn Ala Lys Arg Thr Leu Asp 

325 330 335 

Gly Lys Leu Gly Tyr Val Leu Leu He Gly Lys Pro Leu Ser Lys Tyr 

340 345 350 

Leu Ala Asp Thr Asp Asn Thr Lys Val Pro Thr Ser Phe He His Asn 

355 360 365 

Val Ser Leu He Pro Ser His Pro Thr Phe Gly Ser lie Cys Ala Ser 

370 375 380 

Asp Tyr Phe Phe Ser Cys Val Ser Pro Leu Asp Thr Val Gly Asp Leu 
385 390 395 400 



Phe lie Gly Arg 

He Glu Lys Thr 
420 

Lys Asn He Leu 
435 

Arg Leu Phe Leu 
450 

Leu Lys Ser Asn 
465 

Asn Asn Gly Ser 

Val Trp Gly He 
500 

Leu Asn Asn Thr 
515 

Ser Ala Val Ala 
530 

Thr Tyr Ala Pro 
545 

Ala Thr Gin Tyr 

Phe Tyr Glu Tyr 
580 

Val Gly Glu Met 
595 

Thr Tyr Ser Lys 
610 

He Met Ala His 
625 

Asn Thr He He 

Lys He Pro Glu 
660 

Gin Val Met Ser 
675 



Phe Ser Val Thr 
405 

He Asn Lys Glu 

Tyr Ala Glu Gly 
440 

Lys Glu He Ala 
455 

Gin Val Ser Ala 
470 

His His Phe Tyr 
485 

Gly Gin Gly Leu 

Ser Ser Gin Gly 
520 

Asp Ser Thr He 
535 

Asn Lys Gly Phe 
550 

Ala Val Tyr Leu 
565 

Leu Pro Tyr Ser 

Leu Leu Ser Ser 
600 

Phe Asn Phe Asn 
615 

Gly Met Glu Val 
630 

Ser Ser Pro He 
645 

Lys Gly Val Leu 

Gly Gly Thr Leu 
680 
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Asn Ala His Glu 
410 

He Ser Tyr Asn 
425 

Lys Gly Cys Asp 

Ser Gly Tyr Thr 
460 

He Asp Ser He 
475 

Phe Asn Thr His 
490 

Asp Val Asn Thr 
505 

Leu Cys Thr Ser 

Arg Ser Leu Gly 
540 

Ser Ala Phe Leu 
555 

Glu Gly Pro Cys 
570 

Leu Tyr His Asn 
585 

He He Asn Thr 

Leu Leu Gly Asp 
620 

Ser Asn Cys He 
635 

Thr He Lys Asn 
650 

His Phe Thr Asn 
665 

Glu He Gly Asn 



Leu His Asn Leu 
415 

Pro He Ala His 
430 

Ala Pro He Leu 
445 

Val Asn Ser He 

Phe Asp Cys Leu 
480 

Gly Met Pro Thr 
495 

Leu Thr Ala Arg 
510 

Leu Ser Cys Ser 
525 

Glu Val Leu Thr 

Gly Gly Ser Arg 
560 

Pro Pro Ser Glu 
575 

Leu Ser Thr Val 

- 5 9*0 
Asn Ser Val Asp 
605 

Pro Ala Leu Asn 

Thr Leu Pro Asn 
640 

Gly Gly Cys Leu 
655 

Asn Gly Ser He 
670 

Gin Ala Lys He 
685 
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Ser Gly Glu Thr 
690 

Gly Leu Ala lie 
705 

Asn Leu Phe Ser 

Val Lys Phe Asn 
740 

lie Ser Asn Cys 
755 

Cys Asp Leu Ser 
770 

Val Phe Lys Pro 
785 

Ala Lys lie Thr 

His lie Thr Asn 
820 

Lys Leu Asp Asn 
835 

Asn Cys Asp Glu 
850 

Arg Leu His Asn 
865 

Ser Thr Leu Tyr 

Asn Asn His lie 
900 

Asp Asn Ala Pro 
915 

Asn Arg Thr Trp 
930 

Phe His Tyr Asn 
945 

Asn Asp Thr Tyr 



Gly Ala Asn Pro 
695 

Asn Lys Gin Val 
710 

Thr His Ser Val 
725 

Ser Ala Pro Leu 

Glu Phe Thr Asn 
760 

Val Glu Asn Ser 
775 

Met Ala Thr Ser 
790 

Asp Asn Thr Phe 
805 

Thr Pro Gly Leu 

lie Pro Glu Tyr 
840 

Ala Leu Val Leu 
855 

lie Thr Arg Asn 
870 

Asn Ser Tyr Gly 
885 

Gly Val Arg Leu 

Val lie Asn Glu 

920 

Gin Leu Tyr Ser 
935 

Ser Leu Gin Gly 
950 

Thr Asn Arg Tyr 
965 



Thr Phe lie Thr 
700 

Glu lie Asp Asn 
715 

Met Pro Lys Phe 
730 

Tyr Thr Thr Asn 
745 

Arg Ser Asp lie 

Met Phe Ser Ser 
780 

Ser lie Thr Gly 
795 

Phe Ala Thr Gly 
810 

Thr Ala Thr Ser 
825 

Tyr lie Ser Gly 

Asn Asn Ser Gly 
860 

Val lie Lys Asn 
875 

lie Tyr Asn Arg 
890 

Leu Asn Asn Ser 
905 

Glu Asp Lys Gin 

Ser Asn Gly Thr 
940 

Gly Asp Thr Asp 
955 

lie Asp Val Ser 
970 



Val Tyr Gly Asp 

lie Asp Arg Leu 
720 

His Phe Asp Ser 
735 

Cys lie Val Glu 
750 

lie Ser Lys Asn 
765 

Ser Gly lie Thr 

Leu Ser Thr Lys 
800 

Asn Phe Ala Tyr 
815 

Asn Ala Ala lie 
830 

Asn Lys lie Val 
845 

Asn Arg Thr Asn 

Cys Arg lie Gly 
880 

Asn Lys lie Ser 
895 

Cys Phe Tyr Phe 
910 

Thr Phe lie Ser 
925 

Phe Pro Leu Asn 

Thr Trp lie Tyr 
960 

Asn Asn His Trp 
975 
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Gly Asn Asn Asp Leu Phe Asp Pro Asn Gin Val Phe Asn Thr Pro Asp 

980 985 990 

Leu Phe He Trp He Pro Phe Trp Asp Gly Leu Pro Asn Gly Arg Ser 
995 1000 1005 

5 Gly Asn Ser Ser Ala Glu Ala Val Glu Phe Gin Thr Ala Leu Asp Cys 
1010 1015 1020 

He Gly Asn Ser Asp Tyr Leu Ser Ala Lys Val Ala Leu Lys Met Met 
1025 1030 1035 1040 

Val Glu Thr Tyr Pro Glu Ser Asp Phe Ala He Ala Ala Leu Lys Glu 
10 1045 1050 1055 

Leu Phe Arg He Glu Lys Met Ser Gly Asn Asp Tyr Glu Gly Leu Lys 

1060 1065 1070 

Asp Tyr Phe Arg Ser Asn Pro Thr He He Ser Ser Gin Asn Leu Phe 
1075 1080 1085 

15 Pro Thr Ala Asp Phe Leu Ser Ala Arg Cys Asp He Val Cys Glu Asn 
1090 1095 1100 

Tyr Gin Ser Ala He Asp Trp Tyr Glu Asn Arg Leu Asn Ser Glu He 
1105 1110 1H5 1120 

Ser Tyr Gin Asp Ser Val Phe Ala Val He Asp Leu Gly Asp He Tyr 
20 1125 1130 1135 

Trp Asn Met Gin Leu Asp Ser Leu Arg Gly Thr Gly He Asp Leu Asn 

1140 1145 115° 

He Leu Ser Cys Glu Gin Arg Lys Ser Leu Glu Ser His Gin Asn Val 
1155 1160 1165- 

25 Lys Asn Tyr Leu Leu Ser Thr Leu Pro Glu Ser Thr Gly Thr Leu Leu 
1170 1175 H80 

Pro Pro Leu Glu Cys Asn Lys Ser Ser Leu Asp Lys Ser Lys He He 
1185 1190 1195 1200 

Ser He Ser Pro Asn Pro Ala Lys Ala Val Val Thr He He Tyr Tyr 
30 1205 1210 1215 

Thr Asp Asn Pro Ser Cys Ser Val He Lys He Tyr Gly He Asn Gly 

1220 1225 1230 

Ala Ser Ala Asp He Thr Gly Leu Pro Lys His Leu Ser Glu Gly Tyr 
1235 1240 1245 

35 Tyr Ser He Gin Phe Asn Thr Ser Asn Phe Asp Pro Gly Phe Tyr Leu 
1250 1255 1260 



4 



5 



25 
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Val Thr Leu Asn Val Asp Gin Lys lie lie Asp Thr Glu Lys Leu Arg 
1265 1270 1275 1280 

lie Lys 



(2) INFORMATION FOR SEQ ID NO: 127 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 5 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

20 (ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 925 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 



Met Ala lie Met Met Lys Ser lie Val Phe Arg Ala Phe Leu Thr lie 
15 10 15 

Leu Leu Ser Trp Ala Ala lie Thr Asn Pro Thr Ala Gin Glu lie Ser 
20 25 30 

30 Gly Met Asn Ala Ser Cys Leu Ala Ala Pro Ala Gin Pro Asp Thr lie 
35 40 45 

Leu Tyr Glu Ser Phe Glu Asn Gly Pro Val Pro Asn Gly Trp Leu Glu 

50 55 60 

lie Asp Ala Asp Ala Asp Gly Ala Thr Trp Gly Ser Pro Ser Gly Ser 
35 65 70 75 80 

Phe Ser Val Pro Tyr Gly His Asn Gly Leu Cys Thr Tyr Ser His lie 
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85 

Arg Ser Gly lie Ser 
100 

Glu Gly Ala Lys Arg 
115 

Asn Pro Glu His Tyr 
130 

Glu Asp Phe Val Leu 
145 

Pro Leu Val Trp Arg 
165 

Tyr lie Ala Trp Arg 
180 

Leu Lys Leu Asp Asp 
195 

Pro Ala Thr Asp Phe 
210 

Leu Thr Trp Asn Tyr 
225 

Glu Glu Leu Gin Leu 
245 

Leu Ala Gin He Lys 
260 

Tyr Ser Leu Arg Asp 
275 

Val Tyr Asp Glu Ser 
290 

Tyr Ala Thr Asp Ala 
305 

Pro Asn Gly Trp Leu 
325 

Gly His Tyr Leu Asn 
340 

His Cys Ser Leu Ser 
355 

Pro Asp Asn Tyr Leu 



90 

Thr Ala Gly Asn Tyr 
105 

Val Lys Tyr Trp Val 
120 

Ala Val Met Val Ser 
135 

Leu Phe Asp Asp Ser 
150 

Arg Arg He Val Asp 
170 

His Tyr Lys Val Thr 
185 

Val Thr Val Tyr Arg 
200 

Thr Val He Asn He 
215 

Pro Glu Asp Tyr Gin 
230 

Ser Gly Tyr Asn He 
250 

Asp Val Ser He Leu 
265 

Asn Pro Leu Gin Val 
280 

He Glu Ser Ser Thr 
295 

He Leu Tyr Glu Asn 
310 

Val He Asp Ala Asp 
330 

Ala Tyr Asp Ala Phe 
345 

Ala Ser Tyr Val Pro 
360 

He Thr Pro Lys Val 



95 

Leu He Thr Pro Asn He 
110 

Cys Asn Gin Tyr Ser Thr 
125 

Thr Thr Gly Thr Ala He 
140 

He Thr Gly Lys Pro Thr 
155 160 
Leu Pro Glu Gly Thr Lys 
175 

Asp Ser His Thr Glu Phe 
190 

Ser He Glu Gly Pro Glu 
205 

Gly Gin Asn Val Gly Arg 
220 

Pro Glu Gly Lys Gly Asn 
235 240 
Tyr Ala Asn Gly Thr Leu 
255 

Glu Tyr Val Asp Ser Thr 
270 

Glu Tyr Cys Vai Thr Ala 
285 

Val Cys Gly Thr Leu His 
300 

Phe Glu Asn Gly Pro Val 
315 320 
Gly Asp Gly Phe Ser Trp 
335 

Pro Gly His Asn Gly Gly 
350 

Gly He Gly Pro Val Thr 
365 

Glu Gly Ala Lys Arg Val 
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370 375 380 

Lys Tyr Trp Val Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr 
385 390 395 400 

Ala Val Met Ala Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val lie 
5 405 410 415 

Leu Phe Glu Glu Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu 

420 425 430 

Arg Thr lie Asn Leu Pro Glu Gly Thr Lys Tyr lie Ala Trp Arg His 
435 440 445 

10 Tyr Asn Cys Thr Asp lie Tyr Phe Leu Lys Leu Asp Asp lie Thr Val 
450 455 460 

Phe Gly Thr Pro Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val 
465 470 475 480 

Ser Leu lie Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn 
15 485 490 495 

Gly Tyr Glu Pro Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala 

500 505 510 

Gly Tyr Asn lie Tyr Ala Asn Gly Ser Leu Leu Val His lie Gin Asp 
515 520 525 

20 Pro Thr Val Leu Glu Tyr lie Asp Glu Thr Tyr Ser Ser Arg Asp Asp 
530 535 540 

Gin Val Glu Val Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn lie 
545 550 555 560 

Glu Ser Gin Ser Val Cys Asp Lys Leu lie Tyr Asp Ser Gin Ser Asp 
25 565 570 575 

lie lie Leu Tyr Glu Gly Phe Glu Ala Gly Ser lie Pro Glu Gly Trp 

580 585 590 

Leu Leu lie Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro 
595 600 605 

30 Trp Thr Met Tyr Gly His Asp Ser Glu Lys Cys lie Ala Ser Pro Ser 
610 615 620 

Tyr Leu Pro Met lie Gly Val Leu Thr Pro Asp Asn Tyr Leu Val Thr 
625 630 635 640 

Pro Arg Leu Glu Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin 
35 645 650 655 

Asp Ala Val Tyr Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr 
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660 

Gly Thr Ala Val 
675 

Ala Lys Ala Asn 
690 

Gly Thr Lys Tyr 
705 

Phe Leu Leu Leu 

Pro Glu Pro Val 
740 

Gly Arg Leu Lys 
755 

Asp Asp Lys Lys 
770 

Gly Ser Leu Leu 
785 

Asp Glu Thr Tyr 

Val Thr Ala Val 
820 

Lys Leu Asn Tyr 
835 

Ser Leu Lys lie 
850 

Gly Leu Ser Arg 
865 

lie Cys lie Leu 

Val Ser Arg Leu 
900 

Asn Lys Thr Thr 
915 



Glu Asp Phe Val 
680 

Gly Ala Trp Tyr 
695 

lie Ala Trp Arg 
710 

Asp Asp lie Thr 
725 

Thr Asp Phe Val 

Trp Asn Tyr Pro 
760 

Pro Leu Gin Leu 
775 

Val His lie Gin 
790 

Ser Ser Arg Asp 
805 

Tyr Asn Asp Asn 

Thr lie Thr Ser 
840 

Tyr Pro Asn Pro 
855 

Ser Lys Ser Thr 
870 

Arg Glu Glu Thr 
885 

Asn Asp Gly Val 

Thr Glu Lys Val 
920 



665 

Leu Leu Phe Glu 

Glu Arg Thr lie 
700 

His Tyr Asp Cys 
715 

Val Tyr Arg Ser 
730 

Val Ser Leu lie 
745 

Asn Gly Tyr Glu 

Thr Gly Tyr Asn 
780 

Asp Pro Thr Val 
795 

Gly Gin Val Glu 
810 

lie Glu Ser Gin 
825 

Leu Asp Asn lie 

Ala Ser Tyr Val 
860 

lie Glu Leu Tyr 
875 

His Ser Glu Lys 
890 

Tyr Leu lie Lys 
905 

Glu lie Lys Arg 



670 

Glu Thr Met Thr 
685 

Thr Leu Pro Ala 

Thr Asp Met Phe 
720 

Thr Glu Thr Val 
735 

Glu Asn Asn Lys 
750 

Pro Asp Lys Thr 
765 

lie Tyr Ala Asn 

Leu Glu Tyr lie 
800 

Met Glu Tyr Cys 
815 

Ser Val Cys Asp 
830 

Gin Ser Asp Thr 
845 

Val Arg lie Glu 

Asn Ala Leu Gly 
880 

Thr Glu lie Asp 
895 

Val Val Gly Gly 
910 

Pro 
925 



(2) INFORMATION FOR SEQ ID NO: 128 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



( iii ) HYPOTHETICAL : YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . . 922 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 



Met Met Lys Ser lie Val Phe Arg Ala Phe Leu Thr lie Leu Leu Ser 

15 10 15 

Trp Ala Ala lie Thr Asn Pro Thr Ala Gin Glu lie Ser Gly Met Asn 

20 25 30 

Ala Ser Cys Leu Ala Ala Pro Ala Gin Pro Asp Thr lie Leu Tyr Glu 

35 40 45 . 

Ser Phe Glu Asn Gly Pro Val Pro Asn Gly Trp Leu Glu lie Asp Ala 

50 55 60 

Asp Ala Asp Gly Ala Thr Trp Gly Ser Pro Ser Gly Ser Phe Ser Val 
65 70 75 80 

Pro Tyr Gly His Asn Gly Leu Cys Thr Tyr Ser His lie Arg Ser Gly 

85 90 95 

lie Ser Thr Ala Gly Asn Tyr Leu He Thr Pro Asn He Glu Gly Ala 

100 105 HO 

Lys Arg Val Lys Tyr Trp Val Cys Asn Gin Tyr Ser Thr Asn Pro Glu 

115 120 125 

His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala He Glu Asp Phe 
130 135 140 
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Val Leu Leu Phe Asp Asp Ser lie Thr Gly Lys Pro Thr Pro Leu Val 
145 150 155 160 

Trp Arg Arg Arg lie Val Asp Leu Pro Glu Gly Thr Lys Tyr lie Ala 
165 170 175 

5 Trp Arg His Tyr Lys Val Thr Asp Ser His Thr Glu Phe Leu Lys Leu 
180 185 190 

Asp Asp Val Thr Val Tyr Arg Ser lie Glu Gly Pro Glu Pro Ala Thr 

195 200 205 

Asp Phe Thr Val lie Asn lie Gly Gin Asn Val Gly Arg Leu Thr Trp 
10 210 215 220 

Asn Tyr Pro Glu Asp Tyr Gin Pro Glu Gly Lys Gly Asn Glu Glu Leu 
225 230 235 240 

Gin Leu Ser Gly Tyr Asn lie Tyr Ala Asn Gly Thr Leu Leu Ala Gin 
245 250 255 

15 lie Lys Asp Val Ser lie Leu Glu Tyr Val Asp Ser Thr Tyr Ser Leu 
260 265 270 

Arg Asp Asn Pro Leu Gin Val Glu Tyr Cys Val Thr Ala Val Tyr Asp 

275 280 285 

Glu Ser lie Glu Ser Ser Thr Val Cys Gly Thr Leu His Tyr Ala Thr 
20 290 295 300 

Asp Ala lie Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val Pro Asn Gly 
305 310 315 320 

Trp Leu Val lie Asp Ala Asp Gly Asp Gly Phe Ser Trp Gly His Tyr 
325 330 - 335 

25 Leu Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gly His Cys Ser 
340 345 350 

Leu Ser Ala Ser Tyr Val Pro Gly lie Gly Pro Val Thr Pro Asp Asn 

355 360 365 

Tyr Leu lie Thr Pro Lys Val Glu Gly Ala Lys Arg Val Lys Tyr Trp 
30 370 375 380 

Val Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr Ala Val Met 
385 390 395 400 

Ala Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val lie Leu Phe Glu 
405 410 415 

35 Glu Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Arg Thr lie 
420 425 430 
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Asn Leu Pro Glu Gly Thr Lys Tyr lie Ala Trp Arg His Tyr Asn Cys 

435 440 445 

Thr Asp lie Tyr Phe Leu Lys Leu Asp Asp lie Thr Val Phe Gly Thr 
450 455 460 

5 Pro Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val Ser Leu lie 
465 470 475 480 

Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn Gly Tyr Glu 

485 490 495 

Pro Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala Gly Tyr Asn 
10 500 505 510 

lie Tyr Ala Asn Gly Ser Leu Leu Val His lie Gin Asp Pro Thr Val 

515 520 525 

Leu Glu Tyr lie Asp Glu Thr Tyr Ser Ser Arg Asp Asp Gin Val Glu 
530 535 540 

15 Val Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn lie Glu Ser Gin 
545 550 555 560 

Ser Val Cys Asp Lys Leu lie Tyr Asp Ser Gin Ser Asp lie lie Leu 

565 570 575 

Tyr Glu Gly Phe Glu Ala Gly Ser lie Pro Glu Gly Trp Leu Leu lie 
20 580 585 590 

Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr Met 

595 600 605 

Tyr Gly His Asp Ser Glu Lys Cys lie Ala Ser Pro Ser Tyr Leu Pro 
610 615 620 

25 Met lie Gly Val Leu Thr Pro Asp Asn Tyr Leu Val Thr Pro Arg Leu 
625 630 635 640 

Glu Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val 

645 650 655 

Tyr Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala 
30 660 665 670 

Val Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Met Thr Ala Lys Ala 

675 680 685 

Asn Gly Ala Trp Tyr Glu Arg Thr lie Thr Leu Pro Ala Gly Thr Lys 
690 695 700 

35 Tyr lie Ala Trp Arg His Tyr Asp Cys Thr Asp Met Phe Phe Leu Leu 
705 710 715 720 
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Leu Asp Asp He Thr Val Tyr Arg Ser Thr Glu Thr Val Pro Glu Pro 

725 730 735 

Val Thr Asp Phe Val Val Ser Leu He Glu Asn Asn Lys Gly Arg Leu 

740 745 750 

Lys Trp Asn Tyr Pro Asn Gly Tyr Glu Pro Asp Lys Thr Asp Asp Lys 

755 760 765 

Lys Pro Leu Gin Leu Thr Gly Tyr Asn He Tyr Ala Asn Gly Ser Leu 

770 775 780 

Leu Val His He Gin Asp Pro Thr Val Leu Glu Tyr He Asp Glu Thr 
785 790 795 800 

Tyr Ser Ser Arg Asp Gly Gin Val Glu Met Glu Tyr Cys Val Thr Ala 

805 810 815 

Val Tyr Asn Asp Asn He Glu Ser Gin Ser Val Cys Asp Lys Leu Asn 

820 825 830 

Tyr Thr He Thr Ser Leu Asp Asn He Gin Ser Asp Thr Ser Leu Lys 

835 840 845 

He Tyr Pro Asn Pro Ala Ser Tyr Val Val Arg He Glu Gly Leu Ser 

850 855 860 

Arg Ser Lys Ser Thr He Glu Leu Tyr Asn Ala Leu Gly He Cys He 
865 870 875 880 

Leu Arg Glu Glu Thr His Ser Glu Lys Thr Glu He Asp Val Ser Arg 

885 890 895 

Leu Asn Asp Gly Val Tyr Leu He Lys Val Val Gly Gly Asn Lys Thr 

900 905 - 910 

Thr Thr Glu Lys Val Glu He Lys Arg Pro 
915 920 



(2) INFORMATION FOR SEQ ID NO: 129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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5 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION 1. . . 921 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 

Met Lys Ser lie Val Phe Arg Ala Phe Leu Thr lie Leu Leu Ser Trp 
15 10 15 

Ala Ala lie Thr Asn Pro Thr Ala Gin Glu lie Ser Gly Met Asn Ala 
15 20 25 30 

Ser Cys Leu Ala Ala Pro Ala Gin Pro Asp Thr lie Leu Tyr Glu Ser 

35 40 45 

Phe Glu Asn Gly Pro Val Pro Asn Gly Trp Leu Glu lie Asp Ala Asp 
50 55 60 

20 Ala Asp Gly Ala Thr Trp Gly Ser Pro Ser Gly Ser Phe Ser Val Pro 
65 70 75 80 

Tyr Gly His Asn Gly Leu Cys Thr Tyr Ser His lie Arg Ser Gly lie 

85 90 95 

Ser Thr Ala Gly Asn Tyr Leu lie Thr Pro Asn lie Glu Gly Ala Lys 
25 100 105 110 

Arg Val Lys Tyr Trp Val Cys Asn Gin Tyr Ser Thr Asn Pro Glu His 

115 120 125 

Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala lie Glu Asp Phe Val 
130 135 140 

30 Leu Leu Phe Asp Asp Ser lie Thr Gly Lys Pro Thr Pro Leu Val Trp 
145 150 155 160 

Arg Arg Arg lie Val Asp Leu Pro Glu Gly Thr Lys Tyr lie Ala Trp 

165 170 175 

Arg His Tyr Lys Val Thr Asp Ser His Thr Glu Phe Leu Lys Leu Asp 
35 180 185 190 

Asp Val Thr Val Tyr Arg Ser lie Glu Gly Pro Glu Pro Ala Thr Asp 
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195 200 205 

Phe Thr Val He Asn He Gly Gin Asn Val Gly Arg Leu Thr Trp Asn 



215 220 



210 

/r Pro Glu Asp Tyr Gin Pro Glu Gly Lys Gly Asn Glu Glu Leu Gin 



230 235 240 



Tyi 

225 

Leu Ser Gly Tyr Asn He Tyr Ala Asn Gly Thr Leu Leu Ala Gin He 

245 250 255 

Lys Asp Val Ser He Leu Glu Tyr Val Asp Ser Thr Tyr Ser Leu Arg 

260 265 270 

Asp Asn Pro Leu Gin Val Glu Tyr Cys Val Thr Ala Val Tyr Asp Glu 

275 280 285 

Ser lie Glu Ser Ser Thr Val Cys Gly Thr Leu His Tyr Ala Thr Asp 

290 295 300 

Ala He Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val Pro Asn Gly Trp 

oin ^15 320 

305 310 ^J- 3 

Leu Val He Asp Ala Asp Gly Asp Gly Phe Ser Trp Gly His Tyr Leu 

325 330 335 

Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gly His Cys Ser Leu 

340 345 350 

Ser Ala Ser Tyr Val Pro Gly He Gly Pro Val Thr Pro Asp Asn Tyr 

355 360 365 

Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val Lys Tyr Trp Val 

370 375 380 

Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr Ala Va-1 Met Ala 

, on 395 400 

385 390 

Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He Leu Phe Glu Glu 

405 410 415 

Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Arg Thr He Asn 

420 425 430 

Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Arg His Tyr Asn Cys Thr 

435 440 445 

Asp lie Tyr Phe Leu Lys Leu Asp Asp He Thr Val Phe Gly Thr Pro 

450 455 460 

Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val Ser Leu He Glu 
465 470 475 480 

Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn Gly Tyr Glu Pro 
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485 490 495 

Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala Gly Tyr Asn lie 

500 505 510 

Tyr Ala Asn Gly Ser Leu Leu Val His lie Gin Asp Pro Thr Val Leu 

515 520 525 

Glu Tyr lie Asp Glu Thr Tyr Ser Ser Arg Asp Asp Gin Val Glu Val 

530 535 540 

Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn lie Glu Ser Gin Ser 
545 550 555 560 

Val Cys Asp Lys Leu lie Tyr Asp Ser Gin Ser Asp lie He Leu Tyr 

565 570 575 

Glu Gly Phe Glu Ala Gly Ser He Pro Glu Gly Trp Leu Leu He Asp 

580 585 590 

Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr Met Tyr 

595 600 605 

Gly His Asp Ser Glu Lys Cys He Ala Ser Pro Ser Tyr Leu Pro Met 

610 615 620 

He Gly Val Leu Thr Pro Asp Asn Tyr Leu Val Thr Pro Arg Leu Glu 
625 630 635 640 

Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val Tyr 

645 650 655 

Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala Val 

660 665 670 

Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Met Thr Ala Ly« Ala Asn 

675 680 685 

Gly Ala Trp Tyr Glu Arg Thr He Thr Leu Pro Ala Gly Thr Lys Tyr 

690 695 700 

He Ala Trp Arg His Tyr Asp Cys Thr Asp Met Phe Phe Leu Leu Leu 
705 710 715 720 

Asp Asp He Thr Val Tyr Arg Ser Thr Glu Thr Val Pro Glu Pro Val 

725 730 735 

Thr Asp Phe Val Val Ser Leu He Glu Asn Asn Lys Gly Arg Leu Lys 

740 745 750 

Trp Asn Tyr Pro Asn Gly Tyr Glu Pro Asp Lys Thr Asp Asp Lys Lys 

755 760 765 

Pro Leu Gin Leu Thr Gly Tyr Asn He Tyr Ala Asn Gly Ser Leu Leu 
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770 

Val His lie Gin Asp 
785 

Ser Ser Arg Asp Gly 
805 

Tyr Asn Asp Asn lie 
820 

Thr lie Thr Ser Leu 
835 

Tyr Pro Asn Pro Ala 
850 

Ser Lys Ser Thr lie 
865 

Arg Glu Glu Thr His 
885 

Asn Asp Gly Val Tyr 
900 

Thr Glu Lys Val Glu 
915 



775 

Pro Thr Val Leu Glu Tyr 
790 795 
Gin Val Glu Met Glu Tyr 
810 

Glu Ser Gin Ser Val Cys 
825 

Asp Asn lie Gin Ser Asp 
840 

Ser Tyr Val Val Arg lie 
855 

Glu Leu Tyr Asn Ala Leu 
870 875 
Ser Glu Lys Thr Glu lie 
890 

Leu lie Lys Val Val Gly 
905 

lie Lys Arg Pro 
920 



780 

lie Asp Glu Thr Tyr 
800 

Cys Val Thr Ala Val 
815 

Asp Lys Leu Asn Tyr 
830 

Thr Ser Leu Lys lie 
845 

Glu Gly Leu Ser Arg 
860 

Gly lie Cys lie Leu 
880 

Asp Val Ser Arg Leu 
895 

Gly Asn Lys Thr Thr 
910 



(2} INFORMATION FOR SEQ ID NO: 130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1226 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
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(B) LOCATION 1...122 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 

5 Met Met Lys Arg Tyr Thr lie lie Leu Ala Val Phe Leu Leu Phe Cys 
15 10 15 

Thr Val Phe Thr Phe Gin lie Lys Ala Arg Pro Tyr Glu Arg Phe Ala 

20 25 30 

Asp Val Glu Lys Pro Trp lie Gin Lys His Ser Met Asp Ser Lys Leu 
10 35 40 45 

Val Pro Ala Asn Lys Gly Asn Leu lie Gin Ala Glu lie Val Tyr Gin 

50 55 60 

Ser Val Ser Glu His Ser Asp Leu Val lie Ser Pro Val Asn Glu lie 
65 70 75 80 

15 Arg Pro Ala Asn Arg Phe Pro Ser His Arg Lys Ser Phe Phe Ala Glu 

85 90 95 

Asn Leu Arg Ala Ser Pro Pro Val Val Pro Val Ala Val Asp Lys Tyr 

100 105 110 

Ala Val Pro Val Ala Asn Pro Met Asp Pro Glu Asn Pro Asn Ala Trp 
20 115 120 125 

Asp Val Thr Leu Lys lie Thr Thr Lys Ala Val Thr Val Pro Val Asp 

130 135 140 

Val Val Met Val lie Asp Gin Ser Ser Ser Met Gly Gly Gin Asn lie 
145 150 155 - * 160 

25 Ala Arg Leu Lys Ser Ala lie Ala Ser Gly Gin Arg Phe Val Lys Lys 

165 170 175 

Met Leu Pro Lys Gly Thr Ala Thr Glu Gly Val Arg lie Ala Leu Val 

180 185 190 

Ser Tyr Asp His Glu Pro His Arg Leu Ser Asp Phe Thr Lys Asp Thr 
30 195 200 205 

Ala Phe Leu Cys Gin Lys lie Arg Ala Leu Thr Pro lie Trp Gly Thr 

210 215 220 

His Thr Gin Gly Gly Leu Lys Met Ala Arg Asn lie Met Ala Thr Ser 
225 230 235 240 

35 Thr Ala Val Asp Lys His lie lie Leu Met Ser Asp Gly Leu Ala Thr 

245 250 255 
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Glu Gin Tyr Pro Val Lys Asn Val Thr Thr Ala Asp Phe lie Gly Lys 

260 265 270 

Thr Gly Asn Ala Asn Asp Pro lie Asp Leu Val lie Gin Gly Ala lie 
275 280 285 

5 Asn Phe Pro Thr Asn Tyr Val Ser Asn Asn Pro Ser Thr Pro Leu Thr 
290 295 300 

Pro Asn Tyr Pro Thr His Ser Ser Lys Val Gly Arg Arg Asn Leu Pro 
305 310 315 320 

Glu Ser Lys Phe Asp Tyr Ser Asn Leu Ser Ala Arg lie Thr Phe Asp 
10 325 330 335 

Gly Val Ala Gly Ala Leu Val Tyr Glu Pro Arg Phe Pro His Pro Tyr 

340 345 350 

Tyr Tyr Tyr Phe Pro Cys Asn Ala Ala lie Asn Glu Ala Gin Phe Ala 
355 360 365 

15 Lys Asn Ser Gly Tyr Thr lie His Thr lie Gly Tyr Asp Leu Gly Asp 
370 375 380 

Phe Ala Leu Ala Asn Asn Ser Leu Lys Leu Thr Ala Thr Asp Glu Asn 
385 390 395 400 

His Phe Phe Thr Ala Thr Pro Ala Asn Leu Ala Ala Ala Phe Asp Asn 
20 405 410 415 

lie Ala Gin Thr He Asn He Gly He Gin Arg Gly Glu Val Thr Asp 

420 425 430 

Phe Val Ala Pro Gly Phe He Val Lys Asn Leu Thr Gin Ser Gly Asp 
435 440 445 

25 Val Thr His Leu Leu Asn Val Ser Asn Gly Thr Val His Tyr Asp Val 
450 455 460 

Ser Thr Lys Lys Leu Thr Trp Thr Thr Gly Thr He Leu Ser Ser Ser 
465 470 475 480 

Glu Ala Thr He Thr Tyr Arg He Tyr Ala Asp Leu Asp Tyr He Gin 
30 485 490 495 

Asn Asn Asp lie Pro Val Asn Thr Thr Ser Ala He Gly Pro Asp Leu 

500 505 510 

Gly Gly Phe Asp Thr Asn Thr Glu Ala Lys Leu Thr Tyr Thr Asn Ser 
515 520 525 

35 Asn Gly Glu Pro Asn Gin Gin Leu He Phe Pro Arg Pro Thr Val Lys 
530 535 540 
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Leu Gly Tyr Gly Val lie Lys Arg His Tyr Val Leu Val Asn Lys Asp 
545 550 555 560 

Gly Gin Pro lie Gin Ala Asn Gly Thr Val Val Ser Ser Leu Ser Glu 

565 570 575 

Ala His Val Leu Gin Ser Gin Asp Phe Phe Leu Pro Ser Gly Gly Gly 

580 585 590 

His lie Val Pro Lys Trp He Lys Leu Asp Lys Thr Thr Glu Ala Leu 

595 600 605 

Gin Tyr Tyr Ser Val Pro Pro Thr Asn Thr Val He Thr Thr Ala Asp 

610 615 620 

Gly Lys Arg Tyr Arg Phe Val Glu Val Pro Gly Ser Thr Pro Asn Pro 
625 630 635 640 

Gly Gin He Gly He Ser Trp Lys Lys Pro Ala Gly Asn Ala Tyr Phe 

645 650 655 

Ala Tyr Lys Leu Leu Asn Tyr Trp Met Gly Gly Thr Thr Asp Gin Gin 

660 665 670 

Ser Glu Trp Asp Val Thr Ser Asn Trp Thr Gly Ala Gin Val Pro Leu 

675 680 685 

Thr Gly Glu Asp Val Glu Phe Ala Thr Thr Glu Asn Phe Gly Ser Pro 

690 695 700 

Ala Val Ala Asp Leu His Val Pro Thr Thr Asn Pro Lys He He Gly 
705 710 715 720 

Asn Leu He Asn Asn Ser Asp Lys Asp Leu Val Val Thr Thr Ser Ser 

725 730 » 735 

Gin Leu Thr He Asn Gly Val Val Glu Asp Asn Asn Pro Asn Val Gly 

740 745 750 

Thr He Val Val Lys Ser Ser Lys Asp Asn Pro Thr Gly Thr Leu Leu 

755 760 765 

Phe Ala Asn Pro Gly Tyr Asn Gin Asn Val Gly Gly Thr Val Glu Phe 

770 775 780 

Tyr Asn Gin Gly Tyr Asp Cys Ala Asp Cys Gly Met Tyr Arg Arg Ser 
785 790 795 800 

Trp Gin Tyr Phe Gly He Pro Val Asn Glu Ser Gly Phe Pro He Asn 

805 810 815 

Asp Val Gly Gly Asn Glu Thr Val Asn Gin Trp Val Glu Pro Phe Asn 
820 825 830 
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Gly Asp Lys Trp Arg Pro Ala Pro Tyr Ala Pro Asp Thr Glu Leu Gin 

835 840 845 

Lys Phe Lys Gly Tyr Gin He Thr Asn Asp Val Gin Ala Gin Pro Thr 
850 855 860 

5 Gly Val Tyr Ser Phe Lys Gly Met He Cys Val Cys Asp Ala Phe Leu 
865 870 875 880 

Asn Leu Thr Arg Thr Ser Gly Val Asn Tyr Ser Gly Ala Asn Leu He 

885 890 895 

Gly Asn Ser Tyr Thr Gly Ala He Asp He Lys Gin Gly He Val Phe 
10 900 905 910 

Pro Pro Glu Val Glu Gin Thr Val Tyr Leu Phe Asn Thr Gly Thr Arg 

915 920 925 

Asp Gin Trp Arg Lys Leu Asn Gly Ser Thr Val Ser Gly Tyr Arg Ala 
930 935 940 

15 Gly Gin Tyr Leu Ser Val Pro Lys Asn Thr Ala Gly Gin Asp Asn Leu 
945 950 955 960 

Pro Asp Arg He Pro Ser Met His Ser Phe Leu Val Lys Met Gin Asn 

965 970 975 

Gly Ala Ser Cys Thr Leu Unk He Leu Tyr Asp Lys Leu Leu Lys Asn 
20 980 985 990 

Thr Thr Val Asn Asn Gly Asn Gly Thr Gin He Thr Trp Arg Ser Gly 

995 1000 1005 

Asn Ser Gly Ser Ala Asn Met Pro Ser Leu Val Met Asp Val Leu Gly 
1010 1015 1020- 

25 Asn Glu Ser Ala Asp Arg Leu Trp He Phe Thr Asp Gly Gly Leu Ser 

1025 1030 1035 1040 

Phe Gly Phe Asp Asn Gly Trp Asp Gly Arg Lys Leu Thr Glu Lys Gly 

1045 1050 1055 

Leu Ser Gin Leu Tyr Ala Met Ser Asp He Gly Asn Asp Lys Phe Gin 
30 1060 1065 1070 

Val Ala Gly Val Pro Glu Leu Asn Asn Leu Leu He Gly Phe Asp Ala 

1075 1080 1085 

Asp Lys Asp Gly Gin Tyr Thr Leu Glu Phe Ala Leu Ser Asp His Phe 
1090 1095 1100 

35 Ala Lys Gly Ala Val Tyr Leu His Asp Leu Gin Ser Gly Ala Lys His 

1105 1110 1115 1120 
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Arg lie Thr Asn Ser Thr Ser Tyr Ser Phe Asp Ala Lys Arg Gly Asp 

1125 1130 1135 

Ser Gly Ala Arg Phe Arg Leu Ser Tyr Gly Cys Asp Glu Asn Val Asp 

1140 1145 1150 

Asp Ser His Val Val Ser Thr Asn Gly Arg Glu lie lie lie Leu Asn 

1155 1160 1165 

Gin Asp Ala Leu Asp Cys Thr Val Thr Leu Phe Thr lie Glu Gly Lys 

1170 1175 1180 

Leu Leu Arg Arg Leu Lys Val Leu Ala Gly His Arg Glu Val Met Lys 
1185 1190 1195 12 

Val Gin Thr Gly Gly Ala Tyr lie Val His Leu Gin Asn Ala Phe Thr 

1205 1210 1215 

Asn Asp Val His Lys Val Leu Val Glu Tyr 
1220 1225 

(2) INFORMATION FOR SEQ ID NO: 131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...1225 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 
Met Lys Arg Tyr Thr lie lie Leu Ala Val Phe Leu Leu Phe Cys Thr 
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15 10 15 

Val Phe Thr Phe Gin lie Lys Ala Arg Pro Tyr Glu Arg Phe Ala Asp 

20 25 30 

Val Glu Lys Pro Trp lie Gin Lys His Ser Met Asp Ser Lys Leu Val 
5 35 40 45 

Pro Ala Asn Lys Gly Asn Leu lie Gin Ala Glu lie Val Tyr Gin Ser 

50 55 60 

Val Ser Glu His Ser Asp Leu Val lie Ser Pro Val Asn Glu lie Arg 
65 70 75 80 

10 Pro Ala Asn Arg Phe Pro Ser His Arg Lys Ser Phe Phe Ala Glu Asn 

85 90 95 

Leu Arg Ala Ser Pro Pro Val Val Pro Val Ala Val Asp Lys Tyr Ala 

100 105 110 

Val Pro Val Ala Asn Pro Met Asp Pro Glu Asn Pro Asn Ala Trp Asp 
15 115 120 125 

Val Thr Leu Lys lie Thr Thr Lys Ala Val Thr Val Pro Val Asp Val 

130 135 140 

Val Met Val He Asp Gin Ser Ser Ser Met Gly Gly Gin Asn He Ala 
145 150 155 160 

20 Arg Leu Lys Ser Ala He Ala Ser Gly Gin Arg Phe Val Lys Lys Met 

165 170 175 

Leu Pro Lys Gly Thr Ala Thr Glu Gly Val Arg He Ala Leu Val Ser 

180 185 190 

Tyr Asp His Glu Pro His Arg Leu Ser Asp Phe Thr Lys Asp Thr Ala 
25 195 200 205 

Phe Leu Cys Gin Lys He Arg Ala Leu Thr Pro He Trp Gly Thr His 

210 215 220 

Thr Gin Gly Gly Leu Lys Met Ala Arg Asn He Met Ala Thr Ser Thr 
225 230 235 240 

30 Ala Val Asp Lys His He He Leu Met Ser Asp Gly Leu Ala Thr Glu 

245 250 255 

Gin Tyr Pro Val Lys Asn Val Thr Thr Ala Asp Phe He Gly Lys Thr 

260 265 270 

Gly Asn Ala Asn Asp Pro He Asp Leu Val He Gin Gly Ala He Asn 
35 275 280 285 

Phe Pro Thr Asn Tyr Val Ser Asn Asn Pro Ser Thr Pro Leu Thr Pro 
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290 295 300 

Asn Tyr Pro Thr His Ser Ser Lys Val Gly Arg Arg Asn Leu Pro Glu 
305 310 315 320 

Ser Lys Phe Asp Tyr Ser Asn Leu Ser Ala Arg lie Thr Phe Asp Gly 

325 330 335 

Val Ala Gly Ala Leu Val Tyr Glu Pro Arg Phe Pro His Pro Tyr Tyr 

340 345 350 

Tyr Tyr Phe Pro Cys Asn Ala Ala lie Asn Glu Ala Gin Phe Ala Lys 

355 360 365 

Asn Ser Gly Tyr Thr lie His Thr lie Gly Tyr Asp Leu Gly Asp Phe 

370 375 380 

Ala Leu Ala Asn Asn Ser Leu Lys Leu Thr Ala Thr Asp Glu Asn His 
385 390 395 400 

Phe Phe Thr Ala Thr Pro Ala Asn Leu Ala Ala Ala Phe Asp Asn lie 

405 410 415 

Ala Gin Thr lie Asn lie Gly lie Gin Arg Gly Glu Val Thr Asp Phe 

420 425 430 

Val Ala Pro Gly Phe lie Val Lys Asn Leu Thr Gin Ser Gly Asp Val 

435 440 445 

Thr His Leu Leu Asn Val Ser Asn Gly Thr Val His Tyr Asp Val Ser 

450 455 460 

Thr Lys Lys Leu Thr Trp Thr Thr Gly Thr lie Leu Ser Ser Ser Glu 
465 470 475 480 

Ala Thr lie Thr Tyr Arg lie Tyr Ala Asp Leu Asp Tyr Il-e Gin Asn 

485 490 495 

Asn Asp lie Pro Val Asn Thr Thr Ser Ala lie Gly Pro Asp Leu Gly 

500 505 510 

Gly Phe Asp Thr Asn Thr Glu Ala Lys Leu Thr Tyr Thr Asn Ser Asn 

515 520 525 

Gly Glu Pro Asn Gin Gin Leu lie Phe Pro Arg Pro Thr Val Lys Leu 

530 535 540 

Gly Tyr Gly Val lie Lys Arg His Tyr Val Leu Val Asn Lys Asp Gly 
545 550 555 560 

Gin Pro lie Gin Ala Asn Gly Thr Val Val Ser Ser Leu Ser Glu Ala 

565 570 575 

His Val Leu Gin Ser Gin Asp Phe Phe Leu Pro Ser Gly Gly Gly His 
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580 585 590 

lie Val Pro Lys Trp lie Lys Leu Asp Lys Thr Thr Glu Ala Leu Gin 

595 600 605 

Tyr Tyr Ser Val Pro Pro Thr Asn Thr Val lie Thr Thr Ala Asp Gly 
5 610 615 620 

Lys Arg Tyr Arg Phe Val Glu Val Pro Gly Ser Thr Pro Asn Pro Gly 
625 630 635 640 

Gin lie Gly lie Ser Trp Lys Lys Pro Ala Gly Asn Ala Tyr Phe Ala 
645 650 655 

10 Tyr Lys Leu Leu Asn Tyr Trp Met Gly Gly Thr Thr Asp Gin Gin Ser 
660 665 670 

Glu Trp Asp Val Thr Ser Asn Trp Thr Gly Ala Gin Val Pro Leu Thr 

675 680 685 

Gly Glu Asp Val Glu Phe Ala Thr Thr Glu Asn Phe Gly Ser Pro Ala 
15 690 695 700 

Val Ala Asp Leu His Val Pro Thr Thr Asn Pro Lys lie lie Gly Asn 
705 710 715 720 

Leu lie Asn Asn Ser Asp Lys Asp Leu Val Val Thr Thr Ser Ser Gin 
725 730 735 

20 Leu Thr lie Asn Gly Val Val Glu Asp Asn Asn Pro Asn Val Gly Thr 
740 745 750 

He Val Val Lys Ser Ser Lys Asp Asn Pro Thr Gly Thr Leu Leu Phe 

755 760 765 

Ala Asn Pro Gly Tyr Asn Gin Asn Val Gly Gly Thr Val Glu Phe Tyr 
25 770 775 780 

Asn Gin Gly Tyr Asp Cys Ala Asp Cys Gly Met Tyr Arg Arg Ser Trp 
785 790 795 800 

Gin Tyr Phe Gly He Pro Val Asn Glu Ser Gly Phe Pro He Asn Asp 
805 810 815 

30 Val Gly Gly Asn Glu Thr Val Asn Gin Trp Val Glu Pro Phe Asn Gly 
820 825 830 

Asp Lys Trp Arg Pro Ala Pro Tyr Ala Pro Asp Thr Glu Leu Gin Lys 

835 840 845 

Phe Lys Gly Tyr Gin He Thr Asn Asp Val Gin Ala Gin Pro Thr Gly 
35 850 855 860 

Val Tyr Ser Phe Lys Gly Met He Cys Val Cys Asp Ala Phe Leu Asn 
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865 870 875 880 

Leu Thr Arg Thr Ser Gly Val Asn Tyr Ser Gly Ala Asn Leu lie Gly 

885 890 895 

Asn Ser Tyr Thr Gly Ala lie Asp lie Lys Gin Gly lie Val Phe Pro 

900 905 910 

Pro Glu Val Glu Gin Thr Val Tyr Leu Phe Asn Thr Gly Thr Arg Asp 

915 920 925 

Gin Trp Arg Lys Leu Asn Gly Ser Thr Val Ser Gly Tyr Arg Ala Gly 

930 935 940 

Gin Tyr Leu Ser Val Pro Lys Asn Thr Ala Gly Gin Asp Asn Leu Pro 
945 950 955 960 

Asp Arg lie Pro Ser Met His Ser Phe Leu Val Lys Met Gin Asn Gly 

965 970 975 

Ala Ser Cys Thr Leu Unk lie Leu Tyr Asp Lys Leu Leu Lys Asn Thr 

980 985 990 

Thr Val Asn Asn Gly Asn Gly Thr Gin lie Thr Trp Arg Ser Gly Asn 

995 1000 1005 

Ser Gly Ser Ala Asn Met Pro Ser Leu Val Met Asp Val Leu Gly Asn 

1010 1015 1020 

Glu Ser Ala Asp Arg Leu Trp lie Phe Thr Asp Gly Gly Leu Ser Phe 
1025 1030 1035 1040 

Gly Phe Asp Asn Gly Trp Asp Gly Arg Lys Leu Thr Glu Lys Gly Leu 

1045 1050 1055 

Ser Gin Leu Tyr Ala Met Ser Asp lie Gly Asn Asp Lys Phe Gin Val 

1060 1065 1070 

Ala Gly Val Pro Glu Leu Asn Asn Leu Leu lie Gly Phe Asp Ala Asp 

1075 1080 1085 

Lys Asp Gly Gin Tyr Thr Leu Glu Phe Ala Leu Ser Asp His Phe Ala 

1090 1095 1100 

Lys Gly Ala Val Tyr Leu His Asp Leu Gin Ser Gly Ala Lys His Arg 
1105 1110 1115 1120 

lie Thr Asn Ser Thr Ser Tyr Ser Phe Asp Ala Lys Arg Gly Asp Ser 

1125 1130 1135 

Gly Ala Arg Phe Arg Leu Ser Tyr Gly Cys Asp Glu Asn Val Asp Asp 

1140 1145 1150 

Ser His Val Val Ser Thr Asn Gly Arg Glu lie lie lie Leu Asn Gin 
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1155 1160 1165 

Asp Ala Leu Asp Cys Thr Val Thr Leu Phe Thr He Glu Gly Lys Leu 

1170 1175 1180 

Leu Arg Arg Leu Lys Val Leu Ala Gly His Arg Glu Val Met Lys Val 
5 1185 1190 1195 1200 

Gin Thr Gly Gly Ala Tyr He Val His Leu Gin Asn Ala Phe Thr Asn 

1205 1210 1215 

Asp Val His Lys Val Leu Val Glu Tyr 

1220 1225 
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(2) INFORMATION FOR SEQ ID NO: 132 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 
15 { B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

25 (ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

30 

Met Glu Val Lys Lys Asn Thr Val Val Leu Arg Leu Leu He Trp Phe 
15 10 15 

Val Ala He Leu Leu Phe His Ser Ser Arg Leu Trp Gly Gin Glu Gly 
20 25 30 

35 Glu Gly Ser Ala Arg Tyr Arg Phe Lys Gly Phe Val Asp Thr Tyr His 
35 40 45 
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Ala Val Arg Ser Ser Ser Pro Phe Asp Phe Met Ser Ser Arg Thr Arg 

50 55 60 

Val Arg Gly Glu Leu Glu Arg Ser Phe Gly Asn Ser Lys Val Ala Val 
65 70 75 80 

Ser Val Asn Ala Thr Tyr Asn Ala Leu Leu Lys Asp Glu Thr Gly Leu 

85 90 95 

Arg Leu Arg Glu Ala Phe Phe Glu His Gin Glu Glu His Trp Gly Leu 

100 105 110 

Arg Leu Gly Arg Gin lie Val lie Trp Gly Ala Ala Asp Gly Val Arg 

115 120 125 

lie Thr Asp Leu lie Ser Pro Met Asp Met Thr Glu Phe Leu Ala Gin 

130 135 140 

Asp Tyr Asp Asp lie Arg Met Pro Val Asn Ala Leu Arg Phe Ser Val 
145 150 155 160 

Phe Asn Glu Ser Met Lys Val Glu Val Val Val Leu Pro Val Phe Glu 

165 170 175 

Gly Tyr Arg Leu Pro Val Asp Pro Arg Asn Pro Trp Asn lie Phe Ser 

180 185 190 

Leu Ser Pro lie Ala Gin Gly Met Asn lie Val Trp Lys Glu Glu Ala 

195 200 205 

Gly Lys Pro Ala Phe Lys Val Ala Asn lie Glu Tyr Gly Ala Arg Trp 

210 215 220 

Ser Thr Thr Leu Ser Gly lie Asp Phe Ala Leu Ala Ala Leu His Thr 
225 230 235 . - 240 

Trp Asn Lys Met Pro Val lie Glu Val Gin Gly lie Val Pro Thr Glu 

245 250 255 

lie lie Val Ser Pro Arg Tyr Tyr Arg Met Gly Phe Val Gly Gly Asp 

260 265 270 

Leu Ser Val Pro Val Gly Gin Phe Val Phe Arg Gly Glu Ala Ala Phe 

275 280 285 

Asn lie Asp Lys His Phe Thr Tyr Lys Ser His Ala Glu Gin Glu Gly 

290 295 300 

Phe Gin Thr lie Asn Trp Leu Ala Gly Ala Asp Trp Tyr Ala Pro Gly 
305 310 315 320 

Glu Trp Met lie Ser Gly Gin Phe Ser Met Glu Ser lie Phe Arg Tyr 
325 330 335 
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Arg Asp Phe He Ser Gin Arg Gin His Ser Thr Leu He Thr Leu Asn 

340 345 350 

Val Ser Lys Lys Phe Phe Gly Ser Thr Leu Gin Leu Ser Asp Phe Thr 

355 360 365 

Tyr Tyr Asp Leu Thr Gly Lys Gly Trp Phe Ser Arg Phe Ala Ala Asp 

370 375 380 

Tyr Ala Leu Asn Asp Gin He His Leu Met Ala Gly Tyr Asp Trp Phe 
385 390 395 400 

Ser Ser Lys Gly Ser Gly He Phe Asp Arg Tyr Lys Asp Asn Ser Glu 

405 410 415 

Leu Trp Phe Lys Ala Arg Tyr Ser Phe 
420 425 



(2) INFORMATION FOR SEQ ID NO: 133 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .211 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

Met Ser He Arg Gin Lys He Arg Leu Phe His Leu Ser Val Cys Ala 

15 10 15 

Gin Thr His Asp His Leu He Glu He His Leu Val Cys He Glu Phe 
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20 25 30 

Gly Ala He Asp Thr Asp Glu Phe Arg Leu Ser Ser His Ala Tyr Thr 

35 40 45 

Thr Ser Pro Thr His Thr Gly Ala He His His Asn Cys He Glu Arg 

50 55 60 

Ser Tyr Gly Arg Tyr Leu Val Thr Phe Gly Gin Glu Arg Asn Glu Leu 
65 70 75 80 

His His His Ser Arg Pro Asp Arg Asn Ala Glu Val Tyr Arg Phe Pro 

85 90 95 

Phe Asp Asn Ala Phe His Ser He Arg Tyr Glu Ala Phe Arg Pro He 

100 105 HO 

Arg Pro He Val Cys His Asp Asp His Phe He Ala He Gly Ser His 

115 120 125 

Leu Phe Phe Lys Asp Asn Gin He Phe Ser Ser Gly Ser Gin Tyr Asp 

130 135 140 

Asn Tyr Thr Val Ala Cys Phe Val Glu Ser Leu His Asp Arg Glu Gin 
145 150 155 160 

Arg Ser His Thr His Thr Ala Ser Gly Thr Asn His Cys Ala Asp Leu 

165 170 175 

Phe Asp Met Arg Thr Leu Ser Gin Arg Thr Tyr His He Arg Asp He 

180 185 190 

Val Ala Asp Phe Glu Phe Gly Gin Phe Leu Gly Arg Phe Ala His Gly 

195 200 205 

Leu Asn His 
210 



(2) INFORMATION FOR SEQ ID NO: 134 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



( iii ) HYPOTHETICAL : YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . .260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 

Met Lys Leu Leu Leu Tyr Leu Leu Leu Val Leu Ser Thr Leu Ser Pro 

1 5 10 15 

Met Tyr Ser Gin Met Leu Phe Ser Glu Asn Leu Thr Met Asn He Asp 

20 25 30 

Ser Thr Lys Thr He Gin Gly Thr He Leu Pro Val Leu Asp Phe Lys 

35 40 45 

Thr Glu Lys Glu Asn Val Phe Thr Phe Lys Asn Thr Ala Asn Leu Asn 

50 55 60 

Leu Leu He Lys His Gly Gin Val He Asn Leu He Asn Lys Leu Glu 
65 70 75 80 

Phe Ser Thr Tyr Gly Asn Lys Val Thr Val Ser Gly Gly Tyr Val His 

85 90 95 

Thr Glu Tyr Arg Tyr Leu Leu His His Val Phe Glu Val Tyr Pro Tyr 

100 105 - 1W 

Val Glu Ser Gin Trp Ala Glu Ser Arg Gly Met Lys Tyr Lys Val Ser 

115 120 125 

Thr Gly Leu Gin Ser Arg Tyr Arg Leu Val Asn Ser Asp Asn Cys Leu 

130 135 140 

Met Phe Ala Thr Leu Gly Val Phe Phe Glu Phe Glu Lys Trp Glu Gin 
145 150 155 160 

Pro Ala Thr Ser Leu Phe Ala Gly Thr Tyr Ala Tyr Ser Arg Ser He 

165 170 175 

Lys Ser His Leu Ser He Ser Phe Arg His Arg Leu Gly Glu His Trp 

180 185 190 

Glu Phe Thr Thr Thr Ala He His Gin Gly Lys Pro Asp Ser Tyr Phe 
195 200 205 
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Lys Lys Ala Arg Phe Gly Gly Ala lie Asp Leu Lys Tyr His lie Thr 

210 215 220 

Pro Thr He Gly He Arg Gly Ala Tyr Arg He He Tyr Asp Thr Ala 
225 230 235 240 

Pro He Val Pro Val Arg Lys Asp Tyr Asn Thr Val Asp Val Gly He 
245 250 255 

Asp He Ser Phe 
260 

(2) INFORMATION FOR SEQ ID NO: 135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION 1 . . . 315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 

Met Lys Thr Asn Arg Arg Tyr Ala Phe Val Leu Pro Leu Leu Leu Leu 

15 10 15 

Thr Gly Leu Leu Ala Trp Gly Gin Asp Ser Ser His Gly Ser Asn Thr 

20 25 30 

Ala Phe Ala Thr Asp Ser Ser Ser Arg Glu Leu Pro Thr Glu Gin Ser 

35 40 45 

Ala Tyr Arg He His Ser Ala Tyr Met Val Gly Gly Gly Gly Ser He 
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50 

Thr Arg Asp Thr 
65 

Asn Leu Leu Gly 

Trp Met lie Arg 
100 

Pro Ala Asn Asn 
115 

Ala Ala Leu Tyr 
130 

Asp Asn Leu Arg 
145 

Gly lie Tyr Ser 

Tyr Thr Asn Ala 
180 

Glu Thr Phe Pro 
195 

Gly lie Ala Tyr 
210 

Leu Leu Asn Asn 
225 

Lys Phe Thr Arg 

Asn Phe Cys Thr 
260 

Ser Leu Asn Ala 
275 

Gly Phe Val Thr 
290 

Thr Gly Arg Arg 
305 



55 

Tyr Leu Ser Pro 
70 

Glu Lys Thr Phe 
85 

Thr Gly His Glu 

Ala His Phe Tyr 
120 

Arg Leu Gly Ala 
135 

Leu Ala Phe Gly 
150 

Thr Arg Asn Gly 
165 

lie Ala Gin Ala 

Leu Tyr Phe Arg 
200 

Gly Asn Gly Phe 
215 

Gly lie Ala Gly 
230 

Phe Thr Thr Leu 
245 

Leu Arg Val Gly 

Leu Asp Thr Arg 
280 

Glu Phe Tyr Arg 
295 

Thr Ser Leu Tyr 
310 



60 

Leu Arg Tyr Gly 
75 

Pro Leu Lys Ala 
90 

Leu Asp Phe Ala 
105 

Ser Leu Leu Tyr 

Lys His Leu Arg 
140 

Pro Gly Leu Glu 
155 

Asn Asn Pro Ala 
170 

Ser lie Gly Tyr 
185 

Leu Leu Ser Gin 

Gly Glu Ser Tyr 
220 

Ser Leu His Phe 
235 

lie Thr Ala Asp 
250 

Tyr Arg Tyr Ser 
265 

lie His Ser His 

Phe Arg Gly Arg 
300 

Tyr His Asp 
315 



Gly Trp Thr Leu 
80 

Ser Asp Ser Arg 
95 

Leu Met Asp Asn 
110 

Asn Gly Ser Ala 
125 

Ala Ala Trp Met 

He Gly Leu Gly 
160 

Thr Leu Lys Leu 
175 

Tyr Val Pro Ser 
190 

He Asn Leu Phe 
205 

Tyr Glu Asn Phe 

Thr Tyr Pro Gly 
240 

He Pro He Arg 
255 

His Leu Gly Ser 
270 

Thr Ala Phe He 
285 

Lys Ala Met Asn 



(2) INFORMATION FOR SEQ ID NO: 136 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 

(B) LOCATION 1 . . .77 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 

Met Cys Lys lie Arg Phe Ser Leu Leu Gin Ala Leu Val Val Cys Leu 

15 10 15 

Leu Phe Thr Ser Phe Ser Leu Gin Ala Gin Glu Glu Gly lie Trp Asn 

20 25 30 

Thr Leu Leu Ala lie His Lys Thr Glu Lys Ala Val Glu Thr Pro Lys 

35 40 45- 

Lys Val Phe Ala Val Ala Asn Gly Val Leu Tyr Ser Val Gly Lys Glu 

50 55 60 

Ala Pro His Glu Ala Lys lie Phe Asp Arg lie Ser Gly Leu Ser Asp 
65 70 75 80 

Thr Ser Val Ser Ser lie Ala Tyr Ser Glu Gin Leu Lys Ser Leu Val 

85 90 95 

lie Tyr Tyr Ala Ser Gly Asn lie Asp lie Leu Asp Glu Ala Gly Arg 

100 105 HO 

Val Thr Asn Val Pro Ala Leu Lys Asp Asn lie Asp Leu lie Asp Lys 

115 120 125 

Thr Leu Asn Arg Leu Leu lie Val Gly Asn Arg Ala Tyr Leu Ala Gly 
130 135 140 
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Ala Glu Cys Asn Pro Gly Asp Ala Arg Val Lys Ala lie Ala Phe Asp 

435 440 445 

Asn Lys Gly Asn Leu Trp Gly Thr Leu Gly Ala Val Gly Lys Asn lie 

450 455 460 

Phe Met Tyr Asp Pro Gin Ser Ser Thr Trp His Ser Phe Ser Tyr Pro 
465 470 475 480 

Asp Val Ala Asn Leu Ala Ser Phe Gly Asn Met lie lie Leu Pro Asn 

485 490 495 

Gly Asp Lys Trp Val Asn lie Leu His Arg Ser Gly Gly Ser Thr Arg 

500 505 510 

Lys Gly Val Leu lie Phe Asn Asp Arg Gly Thr Pro Glu Thr Thr Ser 

515 520 525 

Asp Asp Ser His Leu Tyr Val Glu Gin Phe Val Asn Arg Leu Gly Ala 

530 535 540 

Ala He Gly His Lys Thr He Tyr Ala Met Ala Val Asp His Asn Gly 
545 550 555 560 

Ser Val Trp Met Gly Ser Asp He Gly He Phe Gly Val Tyr Asn Ala 

565 570 575 

Ala Gly Val Leu Ser Ser Thr Ser Thr Pro He Ala Val Arg Pro Val 

580 585 590 

Gly Gly Glu Glu Pro Asn Leu Tyr Tyr Val Leu Asp Lys Val Thr Val 

595 600 605 

Thr Asp He Val Val Asp Lys Leu Asn His Lys Trp Val Ala Thr Gin 

610 615 620 

Gly Thr Gly Leu Tyr Leu Leu Ser Glu Asp Cys Ser Lys He Leu Ala 
625 630 635 640 

Gin Phe Thr Val Glu Asn Ser Pro Leu Leu Ser Asn Asn He Leu Ser 

645 650 655 

Leu Ala Leu Asn Asp Asp Asn Gly Leu Leu Tyr He Gly Thr Ala Asp 

660 665 670 

Gly Leu Met Thr Phe Gin Thr Gly Thr Gly Ser Gly Ser Ala Ser Glu 

675 680 685 

Leu Asp Gly Val Tyr Val Tyr Pro Asn Pro Leu Arg Pro Glu Tyr Pro 

690 695 700 

Asp Gly Val Thr He Ala Gly Leu Gin Ala Gly Cys Ser Val Lys He 
705 710 715 720 
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Gly Phe Gly Leu Ser Val Leu Asp Val Ala Glu Ala Arg He Pro Ala 
145 150 155 160 

Thr Tyr Ala Lys Gly Thr Lys Val Thr Asp Val Ala Lys Leu Asp Asn 
165 170 175 

5 Asp Arg Leu Leu Met Leu Lys Glu Gly Gin Leu Phe He Gly Lys Glu 
180 185 190 

Thr Asp Asn Leu Gin Asp Pro Ala Ala Trp Thr Ala Leu Ser Leu Asn 

195 200 205 

Leu Pro Met Gly Ser Val Thr Gly Leu Gly He Val Gly Glu Asp He 
10 210 215 220 

Cys Phe Leu Leu Ala Asp Gly Arg Val Tyr Val Ala Ala Asn Gin Ser 
225 230 235 240 

Phe Glu Pro Glu Leu Leu Leu Ser Ser Ser Ala Asp Ser Arg Leu Tyr 
245 250 255 

15 Val Thr Asp Arg Gly Leu Phe He Cys Ala Glu Asn Arg He Tyr Phe 
260 265 270 

He Glu Lys Gly Arg Lys Thr Thr Gin Phe Pro He Ala Asp Val Leu 

275 280 285 

Gly Val Gly Ala Met Asn Glu Ser Asn Thr Ala Tyr He Ala Leu Gly 
20 290 295 300 

Glu Glu Gly Leu Ala Ser Leu Leu Leu Ala Glu Gly Ser Thr Ala Glu 
305 310 315 320 

Ala Met Pro Val Ala Phe Asp Gly Pro Gly Asp Asn Asp Phe Tyr Glu 
325 330 - * 335 

25 Met Arg Phe Ser His Gly Arg Leu Tyr Ala Ala Ser Gly Leu Trp Gly 
340 345 350 

Thr Asn Leu Met Gly His Ala Gly Met Val Lys Leu Tyr Asp Gly Asn 

355 360 365 

Arg Trp Thr Asn Phe Asp Lys Lys Thr Val Gin Glu Gin Leu Gly Gly 
30 370 375 380 

Gly Phe Ser Phe Asn Asp Ala He Asp He Ala Val Ser Asn Gly Asp 
385 390 395 400 

Pro Asp His Phe Phe Val Gly Thr Trp Gly Asn Gly Leu Phe Glu Phe 
405 410 415 

35 Lys Asp Gly Lys Ala He Ala Arg Tyr Ser Gly Asn Glu Thr Ala He 
420 425 430 



) 
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Thr Asp Thr Thr Gly Arg Leu Leu Tyr Gin Thr Glu Ser Val Thr Thr 

725 730 735 

Glu Val Lys Trp Asn Ala Arg Gly Ala Asp Gly Asn Arg Val Ala Ser 

740 745 750 

Gly Val Tyr Ala Val Ala Val Tyr Asp Pro Val Ser Lys Lys Ser Lys 

755 760 765 

Leu He Arg Phe Ala Val He Arg 
770 775 

(2) INFORMATION FOR SEQ ID NO: 137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1. . .444 

(xi) SEQUENCE DESCRIPTION: SEQIDtfO:137 

Met Arg Phe Gin His Tyr Leu lie Cys Thr Ala Ala Val Ala, Ala Leu 

15 10 15 

Ala Ala Asn Pro Leu Thr Gly Gin Ser As* Met Thr Leu Glu Glu Cys 



20 



25 



30 



lie Asp Tyr Ala Arg Arg His Ser Ser Ala Val Ala Leu Ser Ala Ala 

35 40 45 

Glu Leu Glu Gin Ser Lys Ala Asp Tyr L# Gin Ala Val Gly Asn Phe 
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50 55 60 

Leu Pro Arg Val Set Ala Gly Thr Gly Ala Ser Trp Asn Phe Gly Arg 
65 , 70 75 80 

Gly Leu Asp Ala Glu Thr Asn Thr Tyr Thr Asp He Asn Ser Phe Asn 
5 85 90 95 

Asn Ser Tyr Ser He His Ala Thr Met Thr Leu Phe Asp Gly Leu Gin 

100 105 no 

Ser Val Tyr Arg Leu Arg Met Ala His Ala Arg Arg Glu Ala Ser Arg 
115 120 12 5 

10 Leu Ser Val Arg Glu Gin Gin Glu Leu Ala Ala Leu Gly Thr Thr Glu 
130 135 140 

Ala Tyr Tyr Asp Leu Val Tyr Ala Arg Gin Met Gin Glu Leu Ala Met 
145 150 155 160 

Gin Lys Tyr Glu Glu Ser Ser Arg Leu His Arg Gin Thr Ala Arg Met 
15 165 170 175 

Glu Glu Leu Gly Met Lys Ser Arg Pro Asp Val Leu Glu Met Gin Ser 

180 185 190 

Arg Met Ala Gly Asp Arg Leu Ala Leu Thr Gin Ala Asp Asn Gin Cys 
195 200 205 

20 He He Ala Leu He Arg Leu Lys Glu Lys Met Asn Phe Pro He Asp 
210 215 220 

Asp Glu Leu Val Val Asp Asp Met Pro Ala Asp Ser Leu Ser Ala Asp 
225 230 235 240 

Met Ala Glu Ser Asp Ser Ser Ala Gly Val Phe Ala Arg Alra Ala His 

245 250 255 

His His Pro Val Leu Leu Arg Ala Lys Leu Asp Glu Gin Ala Ala Thr 

260 265 270 

Asp Arg Leu Arg Ala Ala Arg Gly Ala Phe Leu Pro Ser Val Ser Val 



25 



275 



2 10 



285 



30 ser Gly Gly Trp Asn Thr Gly Pie Ser Arg Phe Leu Asn Gly Ser Asp 



290 



295 



300 



35 



Tyr Thr Pro Phe Ser Glu Gin Pie Arg Asn Arg Arg Gly Glu Tyr Val 
305 310 315 320 

Ser Leu Asn L 6U Ser He Pro I]» Phe Ser Gly Phe Ser Leu Val Ser 
\ 325 330 335 

4 Leu Arg Gl n Ala Arg Ala Gl. Arg Arg Ala Ala He Val Arg Arg 
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340 

Gly Glu Ala Glu Arg 
355 



350 

345 

c nu He Ala Gin Ala Met Ala 
Arg Leu Tyr Ser Glu He Aia 

365 



360 



Ma Ser Tyr Arg Gin Ala Lys Glu His Thr 
Asp Arg Asp Ala Ala Leu Ala Ser Ty ^ 

370 T rlu Ala Val Leu Gin Ar« Tyr Glu Glu 

ASP Ala Met Gin Thr Ala Tyr Glu Ala _ ^ 

385 390 

Gly Leu Asn Th 



395 



* Leu Thr Thr Gin Ala *sn Arg Leu Leu 
r Ala He Asp Leu Thr in ^ 

410 



405 



10 Asp Ala Arg Val Gin Arg 

42b 

420 

rlu rvs Leu Ser Asp 
Tyr Tyr Gly ^-i s 

440 



430 



Cys Lys Leu lie Ala 
435 



15 



